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Q (57) Abstract: The invention relates to nitrilases and to nucleic acids encoding the nitrilases. In addition methods of designing 
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MTRILASES, NUCLEIC ACIDS ENCODING THEM AND 
METHODS FOR MAKING AND USING THEM 



CROSS-REFERENCES TO RELATED APPLICATIONS 

[0001] This application claims the benefit of priority to U.S. Patent Application Serial 
5 No. (USSN) 10/241,742, filed September 9, 2002, and USSN 10/146,772, filed May 15, 
2002, which claims the benefit of priority to USSN 60/35 1,336, filed January 22, 2002, 
USSN 60/309, 006, filed July 30, 2001, and USSN 60/300,189, filed June 21, 2001; and is a 
continuation-in-part of USSN 09/751,299, filed December 28, 2000, which claims the benefit 
of priority to each of USSN 60/254,414, filed December 7, 2000, and USSN 60/173,609, 
10 filed December 29, 1999. These applications are hereby incorporated by reference into the 
subject application in their entireties for all purposes. 

COPYRIGHT NOTIFICATION 

[0002] Pursuant to 37 C.F.R. §1 .71(e), a portion of this patent document contains 
material which is subject to copyright protection. The copyright owner has no objection to 
15 the facsimile reproduction by anyone of the patent document or the patent disclosure, as it 
appears in the Patent and Trademark Office patent file or records, but otherwise reserves all 
copyright rights whatsoever. 

FIELD OF THE INVENTION 

[0003] The invention relates generally to the field of molecular biology, biochemistry and 
20 chemistry, and particularly to enzymatic proteins having nitrilase activity. The invention also 
relates to polynucleotides encoding the enzymes, and to uses of such polynucleotides and 
enzymes. 

BACKGROUND OF THE INVENTION 

[0004] There are naturally occurring enzymes which have great potential for use in 
25 industrial chemical processes for the conversion of nitriles to a wide range of useful products 
and intermediates. Such enzymes include nitrilases which are capable of converting nitriles 
directly to carboxylic acids. Nitrilase enzymes are found in a wide range of mesophilic 
micro-organisms, including species of Bacillus, Norcardia, Bacteridium, Rhodococcus, 
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micro-organisms, including species of Bacillus, Norcardia, Bacteridium, Rhodococcus, 
Micrococcus, Brevibacterium, Alcaligenes, Acinetobacter, Corynebacterium, Fusarium and 
Klebsiella. Additionally, there are thermophilic nitrilases which exist in bacteria. 
[0005] There are two major routes from a nitrile to an analogous acid: (1 ) a nitrilase 
5 catalyzes the direct hydrolysis of a nitrile to a carboxylic acid with the concomitant release of 
ammonia; or (2) a nitrile hydratase adds a molecule of water across the carbon-nitrogen 
bonding system to give the corresponding amide, which can then act as a substrate for an 
amidase enzyme which hydrolyzes the carbon-nitrogen bond to give the carboxylic acid 
product with the concomitant release of ammonia. The nitrilase enzyme therefore provides 

1 0 the more direct route to the acid. 

[0006] A nitrile group offers many advantages in devising synthetic routes in that it is 
often easily introduced into a molecular structure and can be carried through many processes 
as a masked acid or amide group. This is only of use, however, if the nitrile can be unmasked 
at the relevant step in the synthesis. Cyanide represents a widely applicable Ci-synthon 

15 (cyanide is one of the few water-stable carbanions) which can be employed for the synthesis 
of a carbon framework. However, further transformations of the nitrile thus obtained are 
impeded due to the harsh reaction conditions required for its hydrolysis using normal 
chemical synthesis procedures. The use of enzymes to catalyze the reactions of nitriles is 
attractive because nitrilase enzymes are able to effect reactions with fewer environmentally 

20 hazardous reagents and by-products than in many traditional chemical methods. Indeed, the 
chemoselective biocatalytic hydrolysis of nitriles represents a valuable alternative because it 
occurs at ambient temperature and near physiological pH. 

[0007] The importance of asymmetric organic synthesis in drug design and discovery has 
fueled the search for new synthetic methods and chiral precursors which can be utilized in 

25 developing complex molecules of biological interest. One important class of chiral 

molecules is the a-substituted carboxylic acids, which include the a-amino acids. These 
molecules have long been recognized as important chiral precursors to a wide variety of 
complex biologically active molecules, and a great deal of research effort has been dedicated 
to the development of methods for the synthesis of enantiomerically pure a-amino acids and 

30 chiral medicines. 

[0008] Of particular use to synthetic chemists who make chiral medicines would be an 
enzyme system which is useful under non-sterile conditions, which is usefiil in non-biological 
laboratories, which is available in a form convenient for storage and use; which has broad 
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substrate specificity, which acts on poorly water soluble substrates; which has predictable 
product structure; which provides a choice of acid or amide product; and which is capable of 
chiral differentiation. Accordingly, there is a need for efficient, inexpensive, high-yield 
synthetic methods for producing enantiomerically pure a-substituted carboxylic acids, such as, 
5 for example, ct-amino acids and a-hydroxy acids. 

[0009] In addition, often, the discovery or evolution of an enzyme to perform a particular 
transformation can be aided by the availability of a convenient high throughput screening or 
selection process. While a surrogate substrate may be used when an effective ultra high 
throughput (UHTP) screen is not available, it may be desirable to screen directly for an enzyme 

1 0 that performs specifically the desired transformation. The challenges of designing an UHTP 
screen is evident when, for example, the discovery or evolution program is aimed at uncovering 
a stereoselective transformation to generate only one stereoisomer or enantiomer. In this case, 
there is a paucity of high throughput screening methods available. While, the most 
straightforward method is to use chiral liquid or gas phase separation to separate the two 

1 5 enantiomers in question, often this approach does not afford the very high throughput capacity 
that is required. By using mass spectroscopy (MS) techniques, very high throughput screens are 
possible. However, when applied in a conventional manner, MS does not afford information on 
chirality or enantioselectivity. 

[0001 0] Another approach is to chemically derivatize the enantiomeric mixture with a single 
20 enantiomer compound, thus generating a diasteriomeric mixture of compounds that can be 

characterized by separation on an achiral stationary phase. Again, this is a cumbersome 

approach and does not lend itself well to high throughput screening. 

[00011] Throughout this application, various publications are referenced by author and 

date. The disclosures of these publications in their entireties are hereby incorporated by 
25 reference into this application in order to more fully describe the state of the art as known to 

those skilled therein as of the date of the invention described and claimed herein. 

SUMMARY OF THE INVENTION 

[00012] The present invention is directed to an isolated or recombinant nucleic acid 
comprising nucleotides having a sequence at least about 50% identical to SEQ ID NO: 1, 3, 
30 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 
59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 
107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 
143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 
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179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199,201,203,205, 207, 209,211,213, 
215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 
251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 
287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311, 313, 315, 317, 319, 321, 
5 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 
359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 385, or variants thereof, 
wherein the nucleic acid encodes a polypeptide having a nitrilase activity. In alternative 
aspects of the invention, the nucleic acid comprises nucleotides having a sequence at least 
about 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 58%, 59%, 60%, 61%, 62%, 63%, 64%, 

1 0 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 
81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 
97%, 98%, 99%, or more, or complete identity (100% identical) to the SEQ ID NO: or 
variants thereof. Exemplary variants may include, for example, the following variations of 
SEQ ID NO:195, 205, 207, 209, OR 237, having one or more mutations: at positions 163- 

15 165 AAA, AAG, GGT, GGC, GGA, GGG, CAA, or CAG; at positions 178-180 GAA or 
GAG; at positions 331-333 TCT, TCC, TCA, TCG, AGT, or AGC; at positions 568-570 
CAT, CAC, TCT, TCC, TCA, TCG, AGT, AGC, ACT, ACC, ACA, TCA, TAT, TAC, ATG 
or ACG; at positions 571-573 TTA, TTG, CTT, CTC, CTA, CTG, GTT, GTC, GTA, GTG, 
ATG, ACT, ACC, ACA, GAT, GAC, GGT, GGC, GGA, GGG, GAA, GAG, TAT, TAC, or 

>0 ACG; at positions 595-597 GAA, GAG, TTA, TTG, CTT, CTC, CTA, or CTG; at positions 
664-666 TTA, TTG, CTT, CTC, CTA, or CTG; or any combination thereof In one aspect of 
the invention, the variants encode a polypeptide having improved or diminished 
enantioselectivity, for example, in the conversion of a 3-hydroxyglutarylnitrile (HGN) to (R>- 
4-Cyano-3-hydroxybutyrate, than the polypeptide encoded by the SEQ ID NO. 

15 [00013] In one aspect of the invention, the nucleic acid comprises nucleotides having a 
sequence substantially identical to the SEQ ID NO: or variants thereof. In another aspect, the 
invention provides for an isolated or recombinant nucleic acid comprising consecutive 
nucleotides having a sequence at least 79 % identical to SEQ ID NO: 33, wherein the nucleic 
acid encodes a polypeptide having nitrilase activity. The invention provides for a fragment of 

i0 the nucleic acid, wherein the fragment encodes a polypeptide having nitrilase activity. The 
invention also provides for an isolated or recombinant nucleic acid complementary to any of 
the nucleic acids. The invention also provides for an isolated or recombinant nucleic acid 
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that hybridizes to any one of the nucleic acids under stringent conditions. In one aspect, the 
stringent conditions comprise at least 50% formamide, and about 37°C to about 42°C. 
[00014] The invention provides for a nucleic acid probe comprising from about 1 5 
nucleotides to about 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 

5 150, 200, 250, 300, 350, 400, 450, 500 or more nucleotides, wherein at least 10, 1 1, 12, 13, 
14, 15, 16, 17, 18, 19, 20 or more consecutive nucleotides are at least 50% complementary to 
a nucleic acid target region within a nucleic acid sequence of SEQ ID NO: 1, 3, 5, 7, 9, 1 1, 
13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 
65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 

10 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 
147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 
183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 
219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 
255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 

15 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311, 313, 315, 317, 319, 321, 323, 325, 
327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 359, 361, 
363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 385, variants thereof, or their 
complements. In one aspect, the nucleic acid probe comprises consecutive nucleotides which 
are at least 55% complementary to the nucleic acid target region. In one aspect, the invention 

20 provides for a nucleic acid probe, wherein the consecutive nucleotides are at least 50%, 51%, 
52%, 53%, 54%, 55%, 56%, 57%, 58%, 59%, 60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 
68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 
84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 
or more or 100% complementary to the nucleic acid target region. In another aspect, the 

25 nucleic acid consists essentially of from about 20 to about 50 nucleotides. In other aspects, 
the nucleic acid can be at least about 20, 25, 30, 35, 40, 45, 50, 75, 100, 150 nucleotides in 
length. 

[00015] The invention provides for a nucleic acid vector capable of replication in a host 
cell, wherein the vector comprises the nucleic acid of the invention. The invention also 
30 provides for a host cell comprising the nucleic acid. The invention also provides for a host 
organism comprising the host cell. In one aspect, the host organism comprises a gram 
negative bacterium, a gram positive bacterium or a eukaryotic organism. In another aspect, 
the gram negative bacterium comprises Escherichia colU or Pseudomonas fluorescens. In a 

5 
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further aspect, the gram positive bacterium comprises Sireptomyces diversa, Lactobacillus 
gasseri, Lactococcus lactis, Lactococcus cremoris, or Bacillus subtilis. In a further aspect, 
the eukaryotic organism comprises Saccharomyces cerevisiae, Schizosaccharomyces pombe, 
Pichia pastoris, Kluyveromyces lactis y Hansenula plymorpha, or Aspergillus niger. 
5 [00016] The invention provides for an isolated or recombinant nucleic acid encoding a 
polypeptide comprising amino acids having a sequence at least 50% identical to SEQ ID 
NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 
52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 
100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 

10 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 
172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 
208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 
244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 
280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 314, 

15 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 
352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, or 
variants thereof, wherein the polypeptide has nitrilase activity. In one aspect, the polypeptide 
comprises amino acids having at least about 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 
58%, 59%, 60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 

>0 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 
90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or more or 100% identity to the 
SEQ ID NO: or variants thereof. Exemplary variants may include, for example, the 
following variations of SEQ ID NO:196, 206, 208, 210 or 238, having one or more 
mutations: at residue 55 lysine, glycine, or glutamine; at residue 60 glutamic acid; at residue 

15 111 serine, at residue 190, serine, histidine, tyrosine or threonine; at residue 191, leucine, 
valine, methionine, aspartic acid, glycine, glutamic acid, tyrosine or threonine; at residue 199 
glutamic acid or leucine; at residue 222 leucine; or any combination thereof 
[00017] The invention also provides for an isolated or recombinant nucleic acid encoding a 
polypeptide comprising at least 10 consecutive amino acids having a sequence identical to a 

10 portion of an amino acid sequence of SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 
28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 
78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 
120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 
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156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 
192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 
228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 
264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 
5 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 
336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 
372, 374, 376, 378, 380, 382, 384, 386, or variants thereof. 

[00018] An isolated or recombinant polypeptide comprising amino acids having a 
sequence at least about 50% identical to SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 

10 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 
76,78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 
118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 
154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 
190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 

1 5 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 
262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 
298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 
334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 
370, 372, 374, 376, 378, 380, 382, 384, 386, or variants thereof, wherein the polypeptide has 

20 nitrilase activity. In one aspect of the invention, the polypeptide comprises amino acids 
having a sequence at least about 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 58%, 59%, 
60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 
76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 
92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or more or 100% identical to the SEQ ID NO: 

15 or variants thereof. 

[00019] The invention provides an isolated or recombinant nucleic acid comprising 
nucleotides having a sequence as set forth in any one of the following SEQ ID NOS:l, 3, 5, 7, 
9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 
61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 

JO 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 
143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 
179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199,201,203,205,207,209,211,213, 
215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 



7 
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251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 
287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311, 313, 315, 317, 319, 321, 
323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 
359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 385, and variants thereof 
5 (hereinafter referred to as "Group A nucleic acids"). The invention is also directed to nucleic 
acids having specified minimum percentages of sequence identity to any of the Group A 
nucleic acids sequences. 

[00020] In another aspect, the invention provides an isolated (purified) or recombinant 
polypeptide comprising amino acid residues having a sequence as set forth in any one of the 

10 following SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 
42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 
92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 
130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 
166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 

15 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 
238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 
274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 
310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 
346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 

20 382, 384, 386, and variants thereof, (hereinafter referred to as "Group B amino acid 
sequences"). The invention is also directed to purified polypeptides having specified 
minimum percentages of sequence identity to any of the Group B amino acid sequences. 
[00021] The invention provides for a fragment of the polypeptide which is at least 50 
amino acids in length, and wherein the fragment has nitrilase activity. Furthermore, the 

25 invention provides for a peptidomimetic of the polypeptide or a fragment thereof having 
nitrilase activity. The invention provides for a codon-optimized polypeptide or a fragment 
thereof, having nitrilase activity, wherein the codon usage is optimized for a particular 
organism or cell. Narum et al Infect. Jmmun. 2001 Dec, 69(12):7250-3 describes codon- 
optimzation in the mouse system. Outchkourov et al. Protein Expr. Purif. 2002 Feb; 

30 24(l):18-24 describes codon-optimization in the yeast system. Feng et al. Biochemistry 2000 
Dec 19, 39(50): 15399-409 describes codon-optimization in E. colt Humphreys et al. Protein 
Expr. Purif 2000 Nov, 20(2):252-64 describes how codon usage affects secretion in E. colt 

8 
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[00022] In one aspect, the organism or cell comprises a gram negative bacterium, a gram 
positive bacterium or a eukaryotic organism. In another aspect of the invention, the gram 
negative bacterium comprises Escherichia coli, or Pseudomonas fluorescens. In another 
aspect of the invention, the gram positive bacterium comprise Streptomyces diversa, 
5 Lactobacillus gasseri, Lactococcus lactis, Lactococcus cremoris, or Bacillus subtilis. In 
another aspect of the invention, the eukaryotic organism comprises Saccharomyces 
cerevisiae, Schizosaccharomyces pombe, Pichia pastoris, Kluyveromyces lactis, Hansenula 
plymorpha, or Aspergillus niger. 

[00023] In another aspect, the invention provides for a purified antibody that specifically 
1 0 binds to the polypeptide of the invention or a fragment thereof, having nitrilase activity. In 
one aspect, the invention provides for a fragment of the antibody that specifically binds to a 
polypeptide having nitrilase activity. 

[00024] The invention provides for an enzyme preparation which comprises at least one of 

the polypeptides of the invention, wherein the preparation is liquid or dry. The enzyme 
15 preparation includes a buffer, cofactor, or second or additional protein. In one aspect the 

preparation is affixed to a solid support. In one aspect of the invention, the solid support can 

be a gel, a resin, a polymer, a ceramic, a glass, a microelectrode and any combination thereof. 

In another aspect, the preparation can be encapsulated in a gel or a bead. 

[00025] The invention further provides for a composition which comprises at least one 
10 nucleic acid of the invention which comprises at least one polypeptide of the invention or a 

fragment thereof, or a peptidomimetic thereof, having nitrilase activity, or any combination 

thereof. 

[00026] The invention provides for a method for hydrolyzing a nitrile to a carboxylic acid 
comprising contacting the molecule with at least one polypeptide of the invention or a 
!5 fragment thereof, or a peptidomimetic thereof, having nitrilase activity, under conditions 
suitable for nitrilase activity. In one aspect, the conditions comprise aqueous conditions. In 
another aspect, the conditions comprise a pH of about 8.0 and/or a temperature from about 
37° C to about 45° C. 

[00027] The invention provides for a method for hydrolyzing a cyanohydrin moiety or an 
0 aminonitrile moiety of a molecule, the method comprising contacting the molecule with at 
least one polypeptide of the invention, or a fragment thereof, or a peptidomimetic thereof, 
having nitrilase activity, under conditions suitable for nitrilase activity. 
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[00028] The invention provides for a method for making a chiral a-hydroxy acid 
molecule, a chiral amino acid molecule, a chiral p-hydroxy acid molecule, or a chiral gamma- 
hydroxy acid molecule, the method comprising admixing a molecule having a cyanohydrin 
moiety or an aminonitrile moiety with at least one polypeptide having an amino acid 
5 sequence at least 50% identical to any one of the Group B amino acid sequences or a 

fragment thereof, or a peptidomimetic thereof, having enantio-selective nitrilase activity. In 
one aspect, the chiral molecule is an (X>enantiomer. In another aspect, the chiral molecule is 
an (^-enantiomer. In one aspect of the invention, one particular enzyme can have R- 
specificity for one particular substrate and the same enzyme can have S-specificity for a 

1 0 different particular substrate. 

[00029] The invention also provides for a method for making a composition or an 
intermediate thereof, the method comprising admixing a precursor of the composition or 
intermediate, wherein the precursor comprises a cyanohydrin moiety or an aminonitrile 
moiety, with at least one polypeptide of the invention or a fragment or peptidomimetic 

15 thereof having nitrilase activity, hydrolyzing the cyanohydrin or the aminonitrile moiety in 
the precursor thereby making the composition or the intermediate thereof In one aspect, the 
composition or intermediate thereof comprises f$-2-amino-4-phenyl butanoic acid. In a 
further aspect, the composition or intermediate thereof comprises an L-amino acid. In a 
further aspect, the composition comprises a food additive or a pharmaceutical drug. 

20 [00030] The invention provides for a method for making an (3?>ethyl 4-cyano-3- 

hydroxybutyric acid, the method comprising contacting a hydroxyglutaryl nitrile with at least 
one polypeptide having an amino acid sequence of the Group B amino acid sequences, or a 
fragment or peptidomimetic thereof having nitrilase activity that selectively produces an (R)- 
enantiomer, so as to make (2y-ethyl 4-cyano-3-hydroxybutyric acid. In one aspect, the ee is 

25 at least 95% or at least 99%. In another aspect, the hydroxyglutaryl nitrile comprises 1 ,3-di- 
cyano-2-hydroxy-propane or 3-hydroxyglutaronitrile. In a further aspect, the polypeptide has 
an amino acid sequence of any one of the Group B amino acid sequences, or a fragment or 
peptidomimetic thereof having nitrilase activity. 

[00031] The invention also provides a method for making an ($-ethyl 4-cyano-3- 
30 hydroxybutyric acid, the method comprising contacting a hydroxyglutaryl nitrile with at least 
one polypeptide having an amino acid sequence of the Group B amino acid sequences, or a 
fragment or peptidomimetic thereof having nitrilase activity that selectively produces an (S)- 
enantiomer, so as to make (S)-dhy\ 4-cyano-3-hydroxybutyric acid. 

10 
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[00032] The invention provides a method for making an (X>mandelic acid, the method 
comprising admixing a mandelonitrile with at least one polypeptide having an amino acid 
sequence of any one of the Group B amino acid sequences or any fragment or 
peptidomimetic thereof having appropriate nitrilase activity. In one aspect, the (2?>mandelic 
5 acid comprises (X>2-chloromandelic acid. In another aspect, the (2?>mandelic acid 

comprises an aromatic ring substitution in the ortho-, meta- y or para- positions; a 1-naphthyl 
derivative of (K>mandelic acid, a pyridyl derivative of (K>mandelic acid or a thienyl 
derivative of (2y-mandelic acid or a combination thereof, 

[00033] The invention provides a method for making an (S>mandelic acid, the method 
1 0 comprising admixing a mandelonitrile with at least one polypeptide having an amino acid 
sequence of Group B sequences or any fragment or peptidomimetic thereof having nitrilase 
activity. In one aspect, the (S>mandelic acid comprises (S>methyl benzyl cyanide and the 
mandelonitrile comprises (S>methoxy-benzyl cyanide. In one aspect, the (2>>mandelic acid 
comprises an aromatic ring substitution in the ortho-, meta-, or para- positions; a 1-naphthyl 
15 derivative of (^-mandelic acid, a pyridyl derivative of (S>mandelic acid or a thienyl 
derivative of (S^-mandelic acid or a combination thereof. 

[00034] The invention also provides a method for making an (5?-phenyl lactic acid 
derivative or an (S^-phenyllactic acid derivative, the method comprising admixing a 
phenyllactonitrile with at least one polypeptide selected from the group of the Group B amino • 
20 acid sequences or any active fragment or peptidomimetic thereof that selectively produces an 
(S>enantiomer or an (Ty-enantiomer, thereby producing an $>>phenyl lactic acid derivative 
or an (R)-pheuy\ lactic acid derivative. 

[00035] The invention provides for a method for making the polypeptide of the invention 
or a fragment thereof, the method comprising (a) introducing a nucleic acid encoding the 

25 polypeptide into a host cell under conditions that permit production of the polypeptide by the 
* host cell, and (b) recovering the polypeptide so produced. 
[00036] The invention provides for a method for generating a nucleic acid variant 
encoding a polypeptide having nitrilase activity, wherein the variant has an altered biological 
activity from that which naturally occurs, the method comprising (a) modifying the nucleic 

30 acid by (i) substituting one or more nucleotides for a different nucleotide, wherein the 
nucleotide comprises a natural or non-natural nucleotide; (ii) deleting one or more 
nucleotides, (iii) adding one or more nucleotides, or (iv) any combination thereof, hi one 
aspect, the non-natural nucleotide comprises inosine. In another aspect, the method further 
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comprises assaying the polypeptides encoded by the modified nucleic acids for altered 
nitrilase activity, thereby identifying the modified nucleic acid(s) encoding a polypeptide 
having altered nitrilase activity. In one aspect, the modifications of step (a) are made by 
PCR, error-prone PCR, shuffling, oligonucleotide-directed mutagenesis, assembly PCR, 
5 sexual PCR mutagenesis, in vivo mutagenesis, cassette mutagenesis, recursive ensemble 
mutagenesis, exponential ensemble mutagenesis, site-specific mutagenesis, gene reassembly, 
gene site saturated mutagenesis, ligase chain reaction, in vitro mutagenesis, ligase chain 
reaction, oligonucleotide synthesis, any DNA-generating technique and any combination 
thereof. In another aspect, the method further comprises at least one repetition of the 

1 0 modification step (a). 

[00037] The invention further provides a method for making a polynucleotide from two or 
more nucleic acids, the method comprising: (a) identifying regions of identity and regions of 
diversity between two or more nucleic acids, wherein at least one of the nucleic acids 
comprises a nucleic acid of the invention; (b) providing a set of oligonucleotides which 

1 5 correspond in sequence to at least two of the two or more nucleic acids; and, (c) extending the 
oligonucleotides with a polymerase, thereby making the polynucleotide. 
[00038J The invention further provides a screening assay for identifying a nitrilase, the 
assay comprising: (a) providing a plurality of nucleic acids or polypeptides comprising at least 
one of the nucleic acids of the invention, or at least one of the polypeptides of the invention; (b) 

20 obtaining polypeptide candidates to be tested for nitrilase activity from the plurality; (c) 

testing the candidates for nitrilase activity; and (d) identifying those polypeptide candidates 
which are nitrilases. In one aspect, the method further comprises modifying at least one of 
the nucleic acids or polypeptides prior to testing the candidates for nitrilase activity. In 
another aspect, the testing of step (c) further comprises testing for improved expression of the 

25 polypeptide in a host cell or host organism. In a further aspect, the testing of step (c) further 
comprises testing for nitrilase activity within a pH range from about pH 3 to about pH 12. In 
a further aspect, the testing of step (c) further comprises testing for nitrilase activity within a 
pH range from about pH 5 to about pH 10. In another aspect, the testing of step (c) further 
comprises testing for nitrilase activity within a temperature range from about 4 °C to about 80 

JO °C. In another aspect, the testing of step (c) further comprises testing for nitrilase activity 
within a temperature range from about 4 °C to about 55 °C. In another aspect, the testing of 
step (c) further comprises testing for nitrilase activity which results in an enantioselective 

12 
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reaction product. In another aspect, the testing of step (c ) further testing for nitrilase activity 
which results in a regio-selective reaction product. 

[00039] The invention provides for use of the nucleic acids of the invention, or a fragment 
or peptidomimetic thereof having nitrilase activity, in a process designed to optimize one 

5 aspect of the gene or one aspect of the polypeptide encoded by the gene. In one aspect, the 
process comprises introducing modifications into the nucleotide sequence of the nucleic acid. 
In another aspect, the modifications are introduced by PCR, error-prone PGR, shuffling, 
oligonucleotide-directed mutagenesis, assembly PCR, sexual PCR mutagenesis, in vivo 
mutagenesis, cassette mutagenesis, recursive ensemble mutagenesis, exponential ensemble 

0 mutagenesis, site-specific mutagenesis, gene reassembly, gene site saturated mutagenesis, 
ligase chain reaction, in vitro mutagenesis, ligase chain reaction, oligonucleotide synthesis, 
any other DNA-generating technique or any combination thereof. In a further aspect, the 
process is repeated. 

[00040] The invention provides for use of the polypeptide of the invention, or a fragment 
5 or peptidomimetic thereof having nitrilase activity, in an industrial process. In one aspect, 
the process is for production of a pharmaceutical composition, the process is for production 
of a chemical, the process is for production of a food additive, the process is catalyzing the 
breakdown of waste, or the process is production of a drug intermediate. In a further aspect, 
the process comprises use of the polypeptide to hydrolyze a hydroxyglutarylnitrile substrate. 
0 In a further aspect, the process is for production of LIPITOR™. In another aspect, the 
polypeptide used comprises a polypeptide having consecutive amino acids of the sequence 
SEQ ID NO:44, 196, 208, 210, or 238 or a fragment thereof having nitrilase activity. In 
another aspect, the process is production of a detergent. In another aspect, the process is 
production of a food product. 
5 [00041] The invention provides for use of a nucleic acid of the invention, or a fragment 
thereof encoding a polypeptide having nitrilase activity, in the preparation of a transgenic 
organism. 

[00042] The invention provides for a kit comprising (a) the nucleic acid of the inventions, 
or a fragment thereof encoding a polypeptide having nitrilase activity, or (b) the polypeptide 
0 of the invention, or a fragment or a peptidomimetic thereof having nitrilase activity, or a 
combination thereof; and (c) a buffer. 

[00043] The invention provides for a method for modifying a molecule comprising: (a) 
mixing a polypeptide of the invention or a fragment or peptidomimetic thereof having 
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nitrilase activity, with a starting molecule to produce a reaction mixture; (b) reacting the 
starting molecule with the polypeptide to produce the modified molecule. 
[00044] The invention provides for a method for identifying a modified compound 
comprising: (a) admixing a polypeptide of the invention, or a fragment or peptidomimetic 
5 thereof having nitrilase activity, with a starting compound to produce a reaction mixture and 
thereafter a library of modified starting compounds; (b) testing the library to determine 
whether a modified starting compound is present within the library which exhibits a desired 
activity; (c) identifying the modified compound exhibiting the desired activity. 
[00045] The invention provides a screening assay for enantioselective transformation 

10 comprising: (a) providing a molecule having two prochiral or enantiotopic moieties; (b) 

labeling at least one prochiral or enantiotopic moiety of the molecule; (b) modifiying at least 
one of the two moieties by a selective catalyst; and (c) detecting results by mass 
spectroscopy. The screening assay can be used to determine or monitor the % enantiomeric 
excess (ee) or determine the % diasteromeric excess (de). An exemplary label useful in the 

15 assay is a heavier isotope or a liter isotope. The selective catalyst useful in the assay can be 
an enzyme. The screening assay may be performed with both moieties labeled. The 
screening assay may be performed in both directions, Le., from the reactants to the products 
as well as from the products to the reactants. 

[00046] The invention provides for a computer readable medium having stored thereon a 
20 nucleic acid of the invention, e.g., a nucleic acid comprising at least one nucleotide sequence 
selected from the group consisting of: SEQ ID NO: 1, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 
27, 29, 31, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 
79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 
121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 
25 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 
193, 195, 197, 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 
229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 
265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 
301, 303, 305, 307, 309, 311, 313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 
JO 337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 
373, 375, 377, 379, 381, 383, 385, and variants thereof, and/or at least one amino acid 
sequence selected from the group consisting of: SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 
22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 
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72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 114, 
116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 
152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 
188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 
5 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 
260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 
296, 298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 
332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 
368, 370, 372, 374, 376, 378, 380, 382, 384, 386, and variants thereof. 

1 0 [00047] The invention provides for a computer system comprising a processor and a data 
storage device, wherein the data storage device has stored thereon a nucleic acid of the 
invention, e.g., a nucleic acid comprising at least one nucleotide sequence selected from the 
group consisting of SEQ ID NO: 1, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 35, 
37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 

15 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 
127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 
163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 
199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 
235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 

20 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 
307, 309, 311, 313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 
343, 345, 347, 349, 351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 
379, 381, 383, 385, and variants thereof, and/or at least one amino acid sequence selected 
from the group consisting of: SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 

25 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 
82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 
124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 
160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 
196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 

30 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 
268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 
304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 
340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 

15 
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376, 378, 380, 382, 384, 386, and variants thereof. In one aspect, the computer system 
further comprises a sequence comparison algorithm and a data storage device having at least 
one reference sequence stored thereon. In another aspect, the sequence comparison algorithm 
comprises a computer program that identifies polymorphisms. 
5 [00048] The invention provides for a method for identifying a feature in a sequence which 
comprises: (a) inputting the sequence into a computer; (b) running a sequence feature 
identification program on the computer so as to identify a feature within the sequence; and (c) 
identifying the feature in the sequence, wherein the sequence comprises a nucleic acid of the 
invention, e.g., a nucleic acid comprising at least one of SEQ ID NOS:l-386, its variants, or 

1 0 any combination thereof. 

[00049] The invention provides for an assay for identifying a functional fragment of a 
polypeptide which comprises: (a) obtaining a fragment of at least one polypeptide of the 
invention; (b) contacting at least one fragment from step (a) with a substrate having a 
cyanohydrin moiety or an aminonitrile moiety under reaction conditions suitable for nitrilase 

1 5 activity; (c) measuring the amount of reaction product produced by each at least one fragment 
from step (b); and (d) identifying the at least one fragment which is capable of producing a 
nitrilase reaction product; thereby identifying a functional fragment of the polypeptide. In 
one aspect, the fragment of step (a) is obtained by synthesizing the fragment. In another 
aspect, the fragment of step (a) is obtained by fragmenting the polypeptides. 

20 [00050] The invention provides for an assay for identifying a functional variant of a 

polypeptide which comprises: (a) obtaining at least one variant of at least one polypeptide of 
the invention; (b) contacting at least one variant from step (a) with a substrate having a 
cyanohydrin moiety or an aminonitrile moiety under reaction conditions suitable for nitrilase 
activity; (c) measuring the amount of reaction product produced by each at least one variant 

25 from step (b); and (d) identifying the at least one variant which is capable of producing a 
nitrilase reaction product; thereby identifying a functional variant of the polypeptide. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[00051] Figure 1 shows chemical reaction schemes wherein stereoselective nitrilases 
hydrolyze a cyanohydrin or an aminonitrile to produce a chiral a-hydroxy acid or a-amino 
30 acid. 

[00052] Figure 2 illustrates an OPA based cyanide detection assay used for identifying the 
presence of nitrilase activity. 
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[00053] Figure 3 is an illustration of a spectroscopic system for the detection and 
quantification of ct-hydroxy acids based on stereoselective lactate dehydrogenases. 
[00054] Figure 4 is an illustration of a spectroscopic system for the detection and 
quantification of a-amino acids based on stereoselective amino acid oxidase. 

5 [00055] Figure 5 is a flow diagram illustrating the steps of a nitrilase screening method. 
[00056] Figures 6A-6E are chromatograms characteristic of the substrate and product 
combination for D-phenylglycine showing a blank sample (Fig. 6A), an enzymatic reaction 
sample (Fig. 6B); a negative control consisting of cell lysate in buffer (Fig. 6C); a chiral 
analysis of phenylglycine (Fig. 6D); and coelution of the nitrile peak with the D-enantiomer 

10 (Fig. 6E). 

[00057] Figures 7A-7E illustrate chromatograms which are characteristic of substrate and 
product combinations for (i?)-2-chloromandelic acid. Fig. 7A shows only 2- 
chloromandelonitrile in buffer; Fig. 7B shows a cloromandelic acid standard. The 
chromatogram in Fig. 7C shows the appearance of product and the reduction of substrate 
15 peaks. 

[00058] Figures 8A-8B illustrate chromatograms characteristic of substrate and product 
combinations for (iS)-phenyllactic acid. 

[00059] Figures 9 A-9B illustrate chromatograms characteristic of substrate and product 
combinations for L-2-methylphenylglycine. 
20 [00060] Figures 10A-1 0C illustrate chromatograms characteristic of substrate and product 
combinations for L-terf-leucine. 

[00061] Figures 1 1 A-l 1C illustrate chromatograms characteristic of substrate and product 
combinations for (S)-2-amino-6-hydroxy hexanoic acid. 

[00062] Figures 12A-12D illustrate chromatograms characteristic of substrate and product 
25 combinations for 4-methyl-D-leucine and 4-methyl-L-leucine. 

[00063] Figures 1 3 A-13B illustrate chromatograms characteristic of substrate and product 
combinations for (iS)-cyclohexylmandelic acid. 

[00064] Figures 14A-14B illustrate two exemplary standard curves for quantitation in 
connection with the screening assay of the invention. 
30 [00065] Figure 15 illustrates selected compounds that can be produced from a nitrilase- 
catalyzed reaction using an enzyme and/or a method of the invention. 
[00066] Figure 16 illustrates selected compounds that can be produced from a nitrilase- 
catalyzed reaction using an enzyme and/or a method of the invention. 

17 
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[000671 Figure 17 illustrates an exemplary nitrilase reaction of the invention. 

DETAILED DESCRIPTION OF THE INVENTION 

[00068] The present invention relates to nitrilases, nucleic acids encoding nitrilases, and 
5 uses therefor. As used herein, the term "nitrilase" encompasses any polypeptide having any 
nitrilase activity, for example, the ability to hydrolyze nitriles into their corresponding 
carboxylic acids and ammonia. Nitrilases have commercial utility as biocatalysts for use in 
the synthesis of enantioselective aromatic and aliphatic amino acids or hydroxy acids. 



R 2 / Rl nitrilase R 2 ^ 1 + Ft, ? 2 

Rr \N Rf %N R ^COOH 

[00069] Nitrilase chemistry is as follows: 



10 [00070] A nitrilase reaction for the preparation of hydroxy acids is as follows: 



nitrilase 





COOH COOH 
[00071] A nitrilase reaction for the preparation of amino acids is as follows: 
NH 2 _ NH 2 NH 2 

v ► y - v 

CN COOH N COOH 



15 [00072] In addition, in each of the foregoing hydrolysis reactions, two water molecules are 
consumed and one ammonia molecule is released. 

[00073] There are several different types of assays which can be performed to test for the 
presence of nitrilase activity in a sample or to test whether a particular polypeptide exhibits 
nitrilase activity. For example, assays can detect the presence or absence of products or by- 
20 products from a chemical reaction catalyzed by a nitrilase. For example, the presence of 
nitrilase activity can be detected by the production of a-hydroxy acids or a-amino acids 
from, respectively, cyanohydrins or aminonitriles, and the level of nitrilase activity can be 
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quantified by measuring the relative quantities of the reaction products produced. Figure 1 
shows chemical reaction schemes using stereoselective nitrilases to create chiral a-hydroxy 
acids or a-amino acids in high yield. The starting material is an aldehyde or an imine which 
is produced from an aldehyde by reaction with ammonia. Reaction of the aldehyde or imine 
5 with hydrogen cyanide results in the production of enantiomeric mixtures of the 

corresponding cyanohydrins and aminonitriles. A stereoselective nitrilase can then be used to 
stereoselectively convert one enantiomer into the corresponding a-hydroxy acid or a-amino 
acid. Figure 3 illustrates schematically the stereoselective nitrilase-dependent production and 
spectrophotometric detection of a-hydroxy acids based on lactate dehydrogenase conversion 

10 of the a-hydroxy acids to the corresponding a-keto acids and concomitant oxidation- 
reduction of a detectable dye. Figure 4 illustrates schematically the stereoselective nitrilase- 
dependent production and spectrophotometric detection of a-amino acids based on amino 
acid oxidase conversion of the a-amino acids to the corresponding a-keto acids and 
concomitant oxidation-reduction of a detectable dye. 

1 5 [00074] Nitrilases contemplated for use in the practice of the present invention include 
those which stereoselectively hydrolyze nitriles or cyanohydrins into their corresponding 
acids and ammonia. In one aspect, nitrilases of the invention can stereoselectively hydrolyze 
nitriles or cyanohydrins into their corresponding acids and ammonia. Nitrilases include, for 
example, nitrilases of the invention, e.g., those set forth in the Group B amino acid 

20 sequences. Some nitrilases which stereoselectively hydrolyze their substrates are set forth in 
the Tables hereinbelow. 

[00075] The nitrilases of the invention share the following additional characteristics: 

(1) full-length amino acid sequences from about 333 amino acids to about 366 amino acids, 

(2) aggregation and activity as homo-multimers of about 2 subunits to about 16 subunits, 
25 (3) presence of a catalytic triad of the consecutive amino acids Glu-Lys-Cys, (4) pH optima 

from about pH 5 to about pH 9, and (5) temperature optima from about 0°C to about 100°C, 
or from about 40°C to about 50°C. 

Consensus Sequences Among New Nitrilases 
[00076] The nitrilases disclosed herein were studied using bioinformatics and sequence 
30 comparison programs and the following consensus information was collected. Three regions 
of conserved motifs were identified within the nitrilase polypeptides. These correspond to 
the catalytic triad (E-K-C) present in nitrilase enzymes. (H. Pace and C. Brenner (Jan. 1 5, 
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2001) "The Nitrilase Superfamily: classification, structure and function" Genome Biology 
Vol. 2, No. l,ppl-9.) 

[00077] The abbreviations used herein are conventional one letter codes for the amino 
acids: A, alanine; B, asparagine or aspartic acid; C, cysteine; D aspartic acid; E, glutamate, 
5 glutamic acid; F, phenylalanine; G, glycine; H histidine; I isoleucine; K, lysine; L, leucine; 
M, methionine; N, asparagine; P, proline; Q, glutamine; R, arginine; S, serine; T, threonine; 
V, valine; W, tryptophan; Y, tyrosine; Z, glutamine or glutamic acid. See L. Stryer, 
Biochemistry, 1988, W. H. Freeman and Company, New York. 

[00078] The computer sequence comparisons made among the nitrilase polypeptide 
10 sequences of the invention resulted in the identification of these motifs within each amino 
acid sequence: 
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[00079] The following residues (those that are underlined) are completely conserved 
among all of the identified nitrilases: the third amino acid in the first motif or region (E, 
1 5 glutamate); the second residue in the second motif (R, arginine); the third residue in the 
second motif (K, lysine); the third residue in the third motif (C, cysteine); and the fifth 
residue in the third motif (E, glutamate). 

[00080] In the boxes, upper case letters indicate 90% or greater consensus among the 
nitrilases of the invention, while lower case letters indicate 50% or greater consensus. An 
20 italicized letter indicates 30% or greater consensus among the nitrilases of the invention. A 
dot in a box indicates a residue which is not conserved. 

[00081] The sequences of nitrilases in the nitrilase branch of the nitrilase superfamily 
were described as having a catalytic triad in the Pace and Brenner article (Genome Biology, 
2001, Vol. 2, No. 1, pp. 1-9). However, the catalytic triad regions of the nitrilases of this 
25 invention differ from those previously identified in the Pace and Brenner reference in the 
following ways: 

[00082] Differences in the first motif : The F in the first box of the first motif is 
conserved in 90% of the nitrilases of the invention, rather than in only 50% of those 
previously identified. The fourth residue of the first motif is a "t", threonine in the nitrilases 
30 of this invention, and it is found at 50% or greater consensus. However, that residue was 
identified by Pace and Brenner as "a" (alanine). The last residue of the first motif was 
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identified as "f ' (phenylalanine) and was indicated to occur at 50% or greater consensus. 
However, the nitrilases of this invention only show "f ' (phenylalanine occurring at 30% 
consensus. 

[00083] Differences in the second motif: There is an "r" (arginine) in the first box of 

5 the second motif of the nitrilases of this invention. However, the Pace and Brenner 

consensus shows an "h M (histidine) in that position. The M R" (arginine) in the second box is 
completely conserved in the nitrilases of the present invention, however that residue only 
appears at 90% consensus in the Pace and Brenner reference. The "L" (leucine) in the fourth 
box of the second motif is conserved in 90% or more of the nitrilases of this invention. 

10 However, the Pace and Brenner nitrilases only showed conservation of that residue in 50% of 
the sequences. Similarly, the "P" (proline) at the sixth box of the second motif is conserved 
in 90% or more of the nitrilases of this invention. However, the Pace and Brenner nitrilases 
only showed conservation of that residue in 50% of the sequences. 
[00084] Differences in the third motif: The "L in the first box is conserved at 90% or 

15 greater in the nitrilases of the invention. However, the Pace and Brenner reference only 

shows that residue appearing 50% of the time. Finally, the sixth box in the third motif in the 
nitrilases of the invention show a histidine 50% of the time or more. However, the Pace and 
Brenner reference indicates that that position shows an asparagine (V) 50% of the time. 
[00085] The invention provides for an isolated polypeptide having nitrilase activity which 

20 polypeptide comprises three regions, wherein the first region comprises five amino acids and 
wherein the first amino acid of the first region is F and the fourth amino acid of the first 
region is T. The invention also provides for an isolated polypeptide having nitrilase activity 
which polypeptide comprises three regions, wherein the second region comprises seven 
amino acids and wherein the first amino acid of the second region is R, wherein the second 

25 amino acid of the second region is R, and wherein the sixth amino acid of the second region 
is P. The invention also provides for an isolated polypeptide having nitrilase activity which 
polypeptide comprises three regions, wherein the third region comprises nine amino acids 
and wherein the first amino acid of the third region is L and the sixth amino acid of the third 
region is H. 

30 [00086] The invention also provides for an isolated polypeptide having nitrilase activity 
which polypeptide comprises three consenus subsequences, wherein the first consensus 
subsequence is FPETF, wherein the second consensus subsequence is RRKLXPT, and 
wherein the third consensus subsequence is LXCWEHXXP. 
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[00087] The invention also provides for an isolated polypeptide having nitrilase activity 
which polypeptide comprises three consenus subsequences, wherein the first consensus 
subsequence is FPEXX, wherein the second consensus subsequence is XRKLXPT, and 
wherein the third consensus subsequence is LXCWEXXXP. 

[00088] In accordance with the present invention, methods are provided for producing 
enantiomerically pure a-substituted carboxylic acids. The enantiomerically pure a- 
substituted carboxylic acids produced by the methods of the present invention have the 
following structure: 



R] * R 2 and Ri and R 2 are otherwise independently -H, substituted or unsubstituted 
alkyl, alkenyl, alkynyl, aryl, heteroaryl, cycloalkyl, or heterocyclic, wherein said substituents 
are lower alkyl, hydroxy, alkoxy, amino, mercapto, cycloalkyl, heterocyclic, aryl, heteroaryl, 
aryloxy, or halogen or optionally R\ and R2 are directly or indirectly covalently joined to form 
a functional cyclic moiety, and E is -N(R X ) 2 or -OH, wherein each R x is independently -H or 
lower alkyl. 

[00089] As used herein, the term "alkyl" refers to straight or branched chain or cyclic 
hydrocarbon groups of from 1 to 24 carbon atoms, including methyl, ethyl, n-propyl, 
isopropyl, n-butyl, isobutyl, tert-butyl, n-pentyl, n-hexyl, and the like. The term "lower 
alkyl'* refers to monovalent straight or branched chain or cyclic radicals of from one to about 
six carbon atoms. 

[00090] As used herein, "alkenyl" refers to straight or branched chain or cyclic 
hydrocarbon groups having one or more carbon-carbon double bonds, and having in the range 
of about 2 to about 24 carbon atoms. . 

[00091] As used herein, "alkynyl" refers to straight or branched chain or cyclic 
hydrocarbon groups having at least one carbon-carbon triple bond, and having in the range of 
about 2 to about 24 carbon atoms. 

[00092] As used herein, "cycloalkyl" refers to cyclic hydrocarbon groups containing in the 
range of about 3 to about 14 carbon atoms. 




wherein: 
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[00093] As used herein, "heterocyclic" refers to cyclic groups containing one or more 
heteroatoms (e.g., N, O, S, P, Se, B, etc.) as part of the ring structure, and having in the range 
of about 3 to about 14 carbon atoms. 

[00094] As used herein, "aryl" refers to aromatic groups (i.e., cyclic groups with 
5 conjugated double-bond systems) having in the range of about 6 to about 14 carbon atoms. 
[00095] As used herein with respect to a chemical group or moiety, the term "substituted" 
refers to such a group or moiety further bearing one or more non-hydrogen substituents. 
Examples of such substituents include, without limitation, oxy (e.g., in a ketone, aldehyde, 
ether, or ester), hydroxy, alkoxy (of a lower alkyl group), amino, thio, mercapto (of a lower 
10 alkyl group), cycloalkyl, substituted cycloalkyl, heterocyclic, substituted heterocyclic, aryl, 
substituted aryl, heteroaryl, substituted heteroaryl, aryloxy, substituted aryloxy, halogen, 
trifluoromethyl, cyano, nitro, nitrone, amino, amido, -C(0)H, acyl, oxyacyl, carboxyl, 
carbamate, sulfonyl, sulfonamide, sulfuryl, and the like. 

[00096] In preferred aspects, the enantiomerically pure a-substituted carboxylic acid 
1 5 produced by the methods of the present invention is an a-amino acid or a-hydroxy acid, hi 
some aspects, the enantiomerically pure a-amino acid is D-phenylalanine, D-phenylglycine, 
L-methylphenylglycine, L~terf-leucine, D-alanine, or D-hydroxynorleucine ((S)-2-amino-6- 
hydroxy hexanoic acid), R-pantolactone, 2-chloromandelic acid, or (S)- or (ii)-mandelic acid 
and the enantiomerically pure a-hydroxy acid is (5)-cyclohexylmandelic acid. As used 
10 herein, a "small molecule" encompasses any molecule having a molecular weight from at 
least 25 Daltons, 

[00097] The term "about" is used herein to mean approximately, roughly, around, or in the 
region of. When the term "about" is used in conjunction with a numerical range, it modifies 
that range by extending the boundaries above and below the numerical values set forth. In 
15 general, the term "about" is used herein to modify a numerical value above and below the 
stated value by a variance of 20 percent up or down (higher or lower). 
[00098] As used herein, the word "or" means any one member of a particular list and also 
includes any combination of members of that list. 

[00099] The phrase "nucleic acid" as used herein refers to a naturally occurring or 
(0 synthetic oligonucleotide or polynucleotide, whether DNA or RNA or DNA-RNA hybrid, 
single-stranded or double-stranded, sense or antisense, which is capable of hybridization to a 
complementary nucleic acid by Watson-Crick base-pairing. Nucleic acids of the invention 
can also include nucleotide analogs (e.g., BrdU), and non-phosphodiester internucleoside 
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linkages (e.g., peptide nucleic acid (PNA) or thiodiester linkages). In particular, nucleic acids 
can include, without limitation, DNA, RNA, cDNA, gDNA, ssDNA or dsDNA or any 
combination thereof. In some aspects, a "nucleic acid" of the invention includes, for 
example, a nucleic acid encoding a polypeptide as set forth in the Group B amino acid 
5 sequences, and variants thereof. The phrase "a nucleic acid sequence" as used herein refers to 
a consecutive list of abbreviations, letters, characters or words, which represent nucleotides. 
In one aspect, a nucleic acid can be a "probe" which is a relatively short nucleic acid, usually 
less than 100 nucleotides in length. Often a nucleic acid probe is from about 50 nucleotides 
in length to about 10 nucleotides in length. A "target region" of a nucleic acid is a portion of 

10 a nucleic acid that is identified to be of interest. 

[000100] A "coding region" of a nucleic acid is the portion of the nucleic acid which is 
transcribed and translated in a sequence-specific manner to produce into a particular 
polypeptide or protein when placed under the control of appropriate regulatory sequences. 
The coding region is said to encode such a polypeptide or protein. 

1 5 [0001 01 ] The term "gene" refers to a coding region operably j oined to appropriate 

regulatory sequences capable of regulating the expression of the polypeptide in some manner. 
A gene includes untranslated regulatory regions of DNA (e.g., promoters, enhancers, 
repressors, etc.) preceding (upstream) and following (downstream) the coding region (open 
reading frame, ORF) as well as, where applicable, intervening sequences (i.e., introns) 

20 between individual coding regions (i.e., exons). 

[000102] "Polypeptide" as used herein refers to any peptide, oligopeptide, polypeptide, 
gene product, expression product, or protein. A polypeptide is comprised of consecutive 
amino acids. The term "polypeptide" encompasses naturally occurring or synthetic 
molecules. 

15 [000103] In addition, as used herein, the term "polypeptide" refers to amino acids joined to 
each other by peptide bonds or modified peptide bonds, e.g., peptide isosteres, and may 
contain modified amino acids other than the 20 gene-encoded amino acids. The polypeptides 
can be modified by either natural processes, such as post-translational processing, or by 
chemical modification techniques which are well known in the art. Modifications can occur 

50 anywhere in the polypeptide, including the peptide backbone, the amino acid side-chains and 
the amino or carboxyl termini. It will be appreciated that the same type of modification can 
be present in the same or varying degrees at several sites in a given polypeptide. Also a 
given polypeptide can have many types of modifications. Modifications include, without 
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limitation, acetylation, acylation, ADP-ribosylation, amidation, covalent cross-linking or 
cyclization, covalent attachment of flavin, covalent attachment of a heme moiety, covalent 
attachment of a nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid 
derivative, covalent attachment of a phosphytidylinositol, disulfide bond formation, 
5 demethylation, formation of cysteine or pyroglutamate, formylation, gamma-carboxylation, 
glycosylation, GPI anchor formation, hydroxylation, iodination, methylation, myristolyation, 
oxidation, pergylation, proteolytic processing, phosphorylation, prenylation, racemization, 
selenoylation, sulfation, and transfer-RNA mediated addition of amino acids to protein such 
as arginylation. (See Proteins - Structure and Molecular Properties 2nd Ed., T.E. Creighton, 
10 W.H. Freeman and Company, New York (1993); Postradiational Covalent Modification of 
Proteins, B.C. Johnson, Ed., Academic Press, New York, pp. 1-12 (1983)). 
[000104] As used herein, the term "amino acid sequence" refers to a list of abbreviations, 
letters, characters or words representing amino acid residues. 

[000105] As used herein, the term "isolated" means that a material has been removed from 

1 5 its original environment. For example, a naturally-occurring polynucleotide or polypeptide 
present in a living animal is not isolated, but the same polynucleotide or polypeptide, 
separated from some or all of the coexisting materials in the natural system, is isolated. Such 
polynucleotides can be part of a vector and/or such polynucleotides or polypeptides could be 
part of a composition, and would be isolated in that such a vector or composition is not part 

20 of its original environment. 

[000106] As used herein with respect to nucleic acids, the term "recombinant" means that 
the nucleic acid is covalently joined and adjacent to a nucleic acid to which it is not adjacent 
in its natural environment. Additionally, as used herein with respect to a particular nucleic 
acid in a population of nucleic acids, the term "enriched" means that the nucleic acid 

25 represents 5% or more of the number of nucleic acids in the population of molecules. 

Typically, the enriched nucleic acids represent 15% or more of the number of nucleic acids in 
the population of molecules. More typically, the enriched nucleic acids represent 50%, 90% 
or more of the number of nucleic acids in the population molecules. 
[000107] "Recombinant" polypeptides or proteins refer to polypeptides or proteins 

30 produced by recombinant DNA techniques, i.e. 9 produced from cells transformed by an 
exogenous recombinant DNA construct encoding the desired polypeptide or protein. 
"Synthetic" polypeptides or proteins are those prepared by chemical synthesis (e.g., solid- 
phase peptide synthesis). Chemical peptide synthesis is well known in the art (see, e.g., 
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Menifield (1963), Am. Chem. Soc. 85:2149-2154; Geysen et al. (1984), Proc. Natl Acad. 
Sci. f USA 81:3998) and synthesis kits and automated peptide synthesizer are commercially 
available (e.g., Cambridge Research Biochemicals, Cleveland, United Kingdom; Model 431 A 
synthesizer from Applied Biosystems, Inc., Foster City, CA). Such equipment provides ready 
5 access to the peptides of the invention, either by direct synthesis or by synthesis of a series of 
fragments that can be coupled using other known techniques. 

[000108] As used herein with respect to pairs of nucleic acid or amino acid sequences, 
"identity" refers to the extent to which the two sequences are invariant at positions within the 
sequence which can be aligned. The percent identity between two given sequences can be 

10 calculated using an algorithm such as BLAST (Altschui et al. (1990), J. Mol Biol 215:403- 
410). Seewww.ncbi.iilm.nih.gov/Education/BLASTinfo. When using the BLAST algorithm 
for sequences no longer than 250 nucleotides or about 80 amino acids ("short queries"), the 
search parameters can be as follows: the filter is off, the scoring matrix is PAM30, the word 
size is 3 or 2, the E value is 1000 or more, and the gap costs are 1 1, 1. For sequences longer 

1 5 than 250 nucleotides or 80 amino acid residues, the default search parameters can be used. 
The BLAST website provides advice for special circumstances which is to be followed in 
such circumstances. 

[000109] As used herein, "homology" has the same meaning as "identity" in the context of 
nucleotide sequences. However, with respect to amino acid sequences, "homology" includes 
20 the percentage of identical and conservative amino acid substitutions. Percentages of 

homology can be calculated according to the algorithms of Smith and Waterman ( 1 98 1 ), Adv. 
Appl. Math. 2:482. 

[000110] As used herein in the context of two or more nucleic acid sequences, two 
sequences are "substantially identical" when they have at least 99.5% nucleotide identity, 

25 when compared and aligned for maximum correspondence, as measured using the known 
sequence comparison algorithms described above. In addition, for purposes of determining 
whether sequences are substantially identical, synonymous codons in a coding region may be 
treated as identical to account for the degeneracy of the genetic code. Typically, the region 
for determination of substantial identity must span at least about 20 residues, and most 

30 commonly the sequences are substantially identical over at least about 25-200 residues. 
[000111] As used herein in the context of two or more amino acid sequences, two 
sequences are "substantially identical" when they have at least 99.5% identity, when 
compared and aligned for maximum correspondence, as measured using the known sequence 
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comparison algorithms described above. In addition, for purposes of determining whether 
sequences are substantially identical, conservative amino acid substitutions may be treated as 
identical if the polypeptide substantially retains its biological function. 
[000112] "Hybridization" refers to the process by which a nucleic acid strand joins with a 
complementary strand through hydrogen bonding at complementary bases. Hybridization 
assays can be sensitive and selective so that a particular sequence of interest can be identified 
even in samples in which it is present at low concentrations. Stringent conditions are defined 
by concentrations of salt or forrnamide in the prehybridization and hybridization solutions, or 
by the hybridization temperature, and are well known in the art. Stringency can be increased 
by reducing the concentration of salt, increasing the concentration of forrnamide, or raising 
the hybridization temperature. In particular, as used herein, "stringent hybridization 
conditions" include 42°C in 50% forrnamide, 5X SSPE, 0.3% SDS, and 200 ng/ml sheared 
and denatured salmon sperm DNA, and equivalents thereof. Variations on the above ranges 
and conditions are well known in the art. 

[000113] The term "variant" refers to polynucleotides or polypeptides of the invention 
modified at one or more nucleotides or amino acid residues (respectively) and wherein the 
encoded polypeptide or polypeptide retains nitrilase activity. Variants can be produced by 
any number of means including, for example, error-prone PCR, shuffling, oligonucleotide- 
directed mutagenesis, assembly PCR, sexual PCR mutagenesis, in vivo mutagenesis, cassette 
mutagenesis, recursive ensemble mutagenesis, exponential ensemble mutagenesis, site- 
specific mutagenesis, gene reassembly, gene site-saturated mutagenesis or any combination 
thereof. 

[000114] Methods of making peptidomimetics based upon a known sequence is described, 
for example, in U.S. Patent Nos. 5,631,280; 5,612,895; and 5,579,250. Use of 
peptidomimetics can involve the incorporation of a non-amino acid residue with non-amide 
linkages at a given position. One aspect of the present invention is a peptidomimetic wherein 
the compound has a bond, a peptide backbone or an amino acid component replaced with a 
suitable mimic. Examples of unnatural amino acids which may be suitable amino acid 
mimics include P-alanine, L-a-amino butyric acid, L-y-amino butyric acid, L-ct-amino 
isobutyric acid, L-e-amino caproic acid, 7-amino heptanoic acid, L-aspartic acid, L-glutamic 
acid, N-e-Boc-N-a-CBZ-L-lysine, N-e-Boc-N-a-Fmoc-L-lysine, L-methionine sulfone, L- 
norleucine, L-norvaline, N-a-Boc-N-5CBZ-L-ornithine, N-5-Boc-N-a-CBZ-L-ornithine, 
Boc-p-nitro-L-phenylalanine, Boc-hydroxyproline, Boc-L-thioproline. 
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[0001 15] As used herein, "small molecule" encompasses a molecule having a molecular 
weight from about 20 daltons to about 1.5 kilodaltons. 

[0001 1 6] The molecular biological techniques, such as subcloning, were performed 
using routine methods which would be well known to one of skill in the art. (Sambrook, J . 
Fritsch, EF, Maniatis, T. (1989) Molecular Cloning: A Laboratory Manual (2nd ed.), Cold 
Spring Harbor Laboratory Press, Plainview NY.). 

Com puter Systems 

[000117] In one aspect of the invention, any nucleic acid sequence and/or polypeptide 
sequence of the invention can be stored, recorded, and manipulated on any medium which 
can be read and accessed by a computer. As used herein, the words "recorded" and "stored" 
refer to a process for storing information on a computer medium. Another aspect of the 
invention is a computer readable medium having recorded thereon at least 2, 5, 10, 15 or 20 
nucleic acid sequences as set forth in SEQ ID NOS: 1-386, and sequences substantially 
identical thereto. In a further aspect, another aspect is the comparison among and between 
nucleic acid sequences or polypeptide sequences of the invention and the comparison 
between sequences of the invention and other sequences by a computer. Computer readable 
media include magnetically readable media, optically readable media, electronically readable 
media and magnetic/optical media. For example, the computer readable media may be a hard 
disk, a floppy disk, a magnetic tape, CD-ROM, Digital Versatile Disk (DVD), Random 
Access Memory (RAM), or Read Only Memory (ROM) as well as other types of other media 
known to those skilled in the art. 

[000118] Aspects of the invention include systems (eg., internet based systems), 
particularly computer systems which store and manipulate the sequence information 
described herein. As used herein, "a computer system" Tefers to the hardware components, 
software components, and data storage components used to analyze a sequence (either 
nucleic acid or polypeptide) as set forth in at least any one of SEQ ID NOS: 1-386 and 
sequences substantially identical thereto. The computer system typically includes a processor 
for processing, accessing and manipulating the sequence data. The processor can be any 
well-known type of central processing unit, such as, for example, the Pentium HI from Intel 
Corporation, or similar processor from Sun, Motorola, Compaq, AMD or International 
Business Machines. 
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[000119] Typically the computer system is a general purpose system that comprises the 
processor and one or more internal data storage components for storing data, and one or more 
data retrieving devices for retrieving the data stored on the data storage components. 

[000120] In one particular aspect, the computer system includes a processor connected to a 
5 bus which is connected to a main memory (preferably implemented as RAM) and one or 

more internal data storage devices, such as a hard drive and/or other computer readable media 
having data recorded thereon. In some aspects, the computer system further includes one or 
more data retrieving device for reading the data stored on the internal data storage devices. 

[000121] The data retrieving device may represent, for example, a floppy disk drive, a 
10 compact disk drive, a magnetic tape drive, or a modem capable of connection to a remote 
data storage system (eg., via the internet) etc. In some aspects, the internal data storage 
device is a removable computer readable medium such as a floppy disk, a compact disk, a 
magnetic tape, etc. containing control logic and/or data recorded thereon. The computer 
system may advantageously include or be programmed by appropriate software for reading 
15 the control logic and/or the data from the data storage component once inserted in the data 
retrieving device. 

[000122] The computer system includes a display which is used to display output to a 
computer user. It should also be noted that the computer system can be linked to other 
computer systems in a network or wide area network to provide centralized access to the 
20 computer system. In some aspects, the computer system may further comprise a sequence 
comparison algorithm. A "sequence comparison algorithm" refers to one or more programs 
which are implemented (locally or remotely) on the computer system to compare a nucleotide 
sequence with other nucleotide sequences and/or compounds stored within a data storage 
means. 

25 Uses of Nitrilases 

[000123] Nitrilases have been identified as key enzymes for the production of chiral a- 
hydroxy acids, which are valuable intermediates in the fine chemicals industry, and as 
pharmaceutical intermediates. The nitrilase enzymes of the invention are useful to catalyze 
the stereoselective hydrolysis of cyanohydrins and aminonitriles, producing chiral a- 

30 hydroxy- and a-amino acids, respectively. 
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[000124] Stereoselective enzymes provide a key advantage over chemical resolution 
methods as they do not require harsh conditions and are more environmentally compatible. 
The use of nitrilases is of particular interest for the production of chiral amino acids and a- 
hydroxy acids. Using a stereoselective nitrilase, dynamic resolution conditions can be 
5 established, due to the racemisation of the substrate under aqueous conditions. Thus 1 00% 
theoretical yields are achievable. 

[000125] This invention is directed to the nitrilases which have been discovered and 
isolated from naturally occurring sources. This invention is also directed to evolving novel 
genes and gene pathways from diverse and extreme environmental sources. In an effort to 

10 develop the most extensive assortment of enzymes available, DNA was extracted directly 
from samples that have been collected from varying habitats around the globe. From these 
efforts, the largest collection of environmental genetic libraries in the world was developed. 
Through extensive high-throughput screening of these libraries, 192 new sequence-unique 
nitrilase enzymes have been discovered to date. Previous to this invention, fewer than 20 

1 5 microbial-derived nitrilases had been reported in the literature and public databases. 

[000126] Biocatalysts, such as nitrilases, play an important role in catalyzing metabolic 
reactions in living organisms. In addition, biocatalysts have found applications in the 
chemical industry, where they can perform many different reactions. Some examples of the 
advantages of the use of nitrilases is that they provide: high enantio-, chemo- and regio- 

20 selectivity; they function under mild reaction conditions; they provide direct access to 
products - with minimal protection; they have high catalytic efficiencies; they produce 
reduced waste compared with the chemical alternatives; they are easily immobilized as 
enzymes or cells; they are recoverable, recyclable and are capable of being manipulated via 
molecular biological techniques; they can be regenerated in whole cell processes; they are 

25 tolerant to organic solvents; and importantly, they can be evolved or optimized. Optimized 
nitrilases are presented herein as working examples of the invention. 

[000127] Nitrilases catalyze the hydrolysis of nitrile moieties generating the corresponding 
carboxylic acid. Conventional chemical hydrolysis of nitriles requires strong acid or base and 
high temperature. However, one advantage of the invention is that nitrilases are provided 
30 which perform this reaction under mild conditions. Wide ranges of nitrile substrates can be 
transformed by nitrilases with high enantio-, chemo- and regio- selectivity. 
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R-C=N + 2H 2 0 nitri ' aSe ■ ? + NH 3 
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Table 1 - Some characteristics of Nitrilases of the Invention 



Previously Discovered New Nitrilases 
Nitrilases 

Limitations New Features Benefits 


<• zu reponea 


^>1 Rft npwlv discovered 


Access to a wider 
substrate range 


Homologous 


Unique nitrilases, many 
with little homology to 
previously known nitrilases 


Narrow substrate 
Activity spectrum 


Broad substrate 
activity spectrum 


Very few shown to be 
enantioselective 


Enantioselective; both 
enantiomers accessible 


Product with high 
enantiomeric excess and 
minimal waste production 


Limited stability profile 


Stable in a variety of 
conditions 


Potential use in a wide range 
of process conditions 


Inconsistent supply 


Consistent supply 


Reliable source of product 


Not applicable 


Amenable to optimization 


Good source material leads to 
better product 



5 



[000128] Dynamic Kine tic Resolution: The use of the nitrilases allows discrimination 
between two rapidly equilibrating enantiomers to give a single product in 100% theoretical 
yield. Nitrilases are utilized for dynamic resolution of key cyanohydrins and aminonitriles to 
produce enantiomerically pure a-carboxylic and a-amino acids. Newly discovered nitrilases 
10 disclosed herein yield products with >95% enantiomeric excess (ee) with and >95% yield. 
The nitrilases perform this transformation efficiently under mild conditions in aqueous 
solution or in the presence of organic solvent. 
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[000129] These products shown above also include the opposite enanatiomers, although 
they are not shown. In one aspect, the invention provides an isolated nucleic acid having a 
5 sequence as set forth in any one of the Group A nucleic acid sequences, having a sequence 
substantially identical thereto, or having a sequence complementary thereto. 

[000130] In another aspect, the invention provides an isolated nucleic acid including at least 
20 consecutive nucleotides identical to a portion of a nucleotide sequence as set forth in the 
Group A nucleic acid sequences, having a sequence substantially identical thereto, or having 
10 a sequence complementary thereto. 

[000131] In another aspect, the invention provides an isolated nucleic acid encoding a 
polypeptide having a sequence as set forth in the Group B amino acid sequences, or having a 
sequence substantially identical thereto. 

[000132] In another aspect, the invention provides an isolated nucleic acid encoding a 
1 5 polypeptide having at least 1 0 consecutive amino acids identical to a portion of a sequence as 
set forth in the Group B amino acid sequences, or having a sequence substantially identical 
thereto. 

[000133] In yet another aspect, the invention provides a substantially purified polypeptide 
comprising consecutive amino acid residues having a sequence as set forth in the Group B 
20 amino acid sequences, ot having a sequence substantially identical thereto. 
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[000134] In another aspect, the invention provides an isolated antibody that specifically 
binds to a polypeptide of the invention. The invention also provides for a fragment of the 
antibody which retains the ability to specifically bind the polypeptide. 

[000135] In another aspect, the invention provides a method of producing a polypeptide 
5 having a sequence as set forth in the Group B amino acid sequences, and sequences 

substantially identical thereto. The method includes introducing a nucleic acid encoding the 
polypeptide into a host cell, wherein the nucleic acid is operably joined to a promoter, and 
culturing the host cell under conditions that allow expression of the nucleic acid. 

[000136] In another aspect, the invention provides a method of producing a polypeptide 
10 having at least 10 consecutive amino acids from a sequence as set forth in the Group B amino 
acid sequences, and sequences substantially identical thereto. The method includes 
introducing a nucleic acid encoding the polypeptide into a host cell, wherein the nucleic acid 
is operably joined to a promoter, and culturing the host cell under conditions that allow 
expression of the nucleic acid, thereby producing the polypeptide. 

1 5 [000137] In another aspect, the invention provides a method of generating a variant of a 
nitrilase, including choosing a nucleic acid sequence as set forth in the Group A nucleic acid 
sequences, and changing one or more nucleotides in the sequence to another nucleotide, 
deleting one or more nucleotides in the sequence, or adding one or more nucleotides to the 
sequence. 

10 [000138] In another aspect, the invention provides assays for identifying functional variants 
of the Group B amino acid sequences that retain the enzymatic function of the polypeptides 
of the Group B amino acid sequences. The assays include contacting a polypeptide 
comprising consecutive amino acid residues having a sequence identical to a sequence of the 
Group B amino acid sequences or a portion thereof, having a sequence substantially identical 

15 to a sequence of the Group B amino acid sequences or a portion thereof, or having a sequence 
which is a variant of a sequence of the Group B amino acid sequences that retains nitrilase 
activity, with a substrate molecule under conditions which allow the polypeptide to function, 
and detecting either a decrease in the level of substrate or an increase in the level of a specific 
reaction product of the reaction between the polypeptide and the substrate, thereby 

►0 identifying a functional variant of such sequences. 
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Modification of Polypeptides of the Invention 

[000139] Enzymes are highly selective catalysts. Their hallmark is the ability to catalyze 
reactions with exquisite stereo-selectivity, regio-selectivity, and chemo-selectivity that is 

5 unparalleled in conventional synthetic chemistry. Moreover, enzymes are remarkably 
versatile. They can be tailored to function in organic solvents, operate at extreme pHs (for 
example, acidic or basic conditions) extreme temperatures (for example, high temperatures 
and low temperatures), extreme salinity levels (for example, high salinity and low salinity), 
and catalyze reactions with compounds that can be structurally unrelated to their natural, 

10 physiological substrates except for the enzymatic active site. 

[000140] The invention provides methods for modifying polypeptides having nitrilase 
activity or polynucleotides encoding such polypeptides in order to obtain new polypeptides 
which retain nitrilase activity but which are improved with respect to some desired 
characteristic. Such improvements can include the ability to function (z.e., exhibit nitrilase 
15 activity) in organic solvents, operate at extreme or uncharacteristic pHs, operate at extreme or 
uncharacteristic temperatures, operate at extreme or uncharacteristic salinity levels, catalyze 
reactions with different substrates, etc. 

[000141] The present invention directed to methods of using nitrilases so as to exploit the 
unique catalytic properties of these enzymes. Whereas the use of biocatalysts (i.e., purified or 

20 crude enzymes) in chemical transformations normally requires the identification of a 

particular biocatalyst that reacts with a specific starting compound, the present invention uses 
selected biocatalysts and reaction conditions that are specific for functional groups that are 
present in many starting compounds. Each biocatalyst is specific for one functional group, or 
several related functional groups, and can react with many starting compounds containing 

25 this functional group. 

[000142] Enzymes react at specific sites within a starting compound without affecting the 
rest of the molecule, a process which is very difficult to achieve using traditional chemical 
methods. This high degree of specificity provides the means to identify a single active 
compound within a library of compounds. The library is characterized by the series of 
30 biocatalytic reactions used to produce it, a so-called "biosynthetic history." Screening the 
library for biological activities and tracing the biosynthetic history identifies the specific 
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reaction sequence producing the active compound. The reaction sequence is repeated and the 
structure of the synthesized compound determined. This mode of identification, unlike other 
synthesis and screening approaches, does not require immobilization technologies, and 
compounds can be synthesized and tested free in solution using virtually any type of 
5 screening assay. It is important to note, that the high degree of specificity of enzyme 

reactions on functional groups allows for the "tracking" of specific enzymatic reactions that 
make up the biocatalytically produced library. (For further teachings on modification of 
molecules, including small molecules, see PCT Application No. PCT/US94/09174, herein 
incorporated by reference in its entirety). 

1 0 [000143] In one exemplification, the invention provides for the chimerization of a family of 
related nitrilase genes and their encoded family of related products. Thus according to this 
aspect of the invention, the sequences of a plurality of nitrilase nucleic acids (e.g., the Group 
A nucleic acids) serve as nitrilase "templates" which are aligned using a sequence comparison 
algorithm such as those described above. One or more demarcation points are then identified 

15 in the aligned template sequences, which are located at one or more areas of homology. The 
demarcation points can be used to delineate the boundaries of nucleic acid building blocks, 
which are used to generate chimeric nitrilases. Thus, the demarcation points identified and 
selected in the nitrilase template molecules serve as potential chimerization points in the 
assembly of the chimeric nitrilase molecules. 

20 [000144] Typically, a useful demarcation point is an area of local identity between at least 
two progenitor templates, but preferably the demarcation point is an area of identity that is 
shared by at least half of the templates, at least two thirds of the templates, at least three 
fourths of the templates, or at nearly all of the templates. 

[000145] The building blocks, which are defined by the demarcation points, can then be 
25 mixed (either literally, in solution, or theoretically, on paper or in a computer) and 

reassembled to form chimeric nitrilase genes. In one aspect, the gene reassembly process is 
performed exhaustively in order to generate an exhaustive library of all possible 
combinations. In other words, all possible ordered combinations of the nucleic acid building 
blocks are represented in the set of finalized chimeric nucleic acid molecules. At the same 
30 time, however, the order of assembly of each building block in the 5 1 to 3' direction in each 
combination is designed to reflect the order in the templates, and to reduce the production of 
unwanted, inoperative products. 
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[000146] In some aspects, the gene reassembly process is performed systematically, in 
order to generate a compartmentalized library with compartments that can be screened 
systematically, e.g., one by one. In other words, the invention provides that, through the 
selective and judicious use of specific nucleic acid building blocks, coupled with the selective 
5 and judicious use of sequentially stepped assembly reactions, an experimental design can be 
achieved where specific sets of chimeric products are made in each of several reaction 
vessels. This allows a systematic examination and screening procedure to be performed. 
Thus, it allows a potentially very large number of chimeric molecules to be examined 
systematically in smaller groups. 

1 0 [000147] In some aspects, the synthetic nature of the step in which the building blocks are 
generated or reassembled allows the design and introduction of sequences of nucleotides 
(e.g 9 codons or introns or regulatory sequences) that can later be optionally removed in an in 
vitro process (e.g., by mutagenesis) or in an in vivo process {e.g., by utilizing the gene 
splicing ability of a host organism). The introduction of these nucleotides may be desirable 

1 5 for many reasons, including the potential benefit of creating a useful demarcation point. 

[000148] The synthetic gene reassembly method of the invention utilizes a plurality of 
nucleic acid building blocks, each of which has two ligatable ends. Some examples of the 
two ligatable ends on each nucleic acid building block includes, but are not limited to, two 
blunt ends, or one blunt end and one overhang, or two overhangs. In a further, non-limiting 
20 example, the overhang can include one base pair, 2 base pairs, 3 base pairs, 4 base pairs or 
more. 

[000149] A double-stranded nucleic acid building block can be of variable size. Preferred 
sizes for building blocks range from about 1 base pair (bp) (not including any overhangs) to 
about 100,000 base pairs (not including any overhangs). Other preferred size ranges are also 
25 provided, which have lower limits of from about 1 bp to about 10,000 bp (including every 

integer value in between), and upper limits of from about 2 bp to about 100,000 bp (including 
every integer value in between). 

[000150] According to one aspect, a double-stranded nucleic acid building block is 
generated by first generating two single stranded nucleic acids and allowing them to anneal to 
30 form a double-stranded nucleic acid building block. The two strands of a double-stranded 
nucleic acid building block may be complementary at every nucleotide apart from any that 
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form an overhang; thus containing no mismatches, apart from any overhang(s). 
Alternatively, the two strands of a double-stranded nucleic acid building block can be 
complementary at fewer than every nucleotide, apart from any overhang(s). In particular, 
mismatches between the strands can be used to introduce codon degeneracy using methods 
5 such as the site-saturation mutagenesis described herein. 

[000151] In vivo shuffling of molecules is also useful in providing variants and can be 
performed utilizing the natural property of cells to recombine multimers. While 
recombination in vivo has provided the major natural route to molecular diversity, genetic 
recombination remains a relatively complex process that involves (1) the recognition of 
10 homologies; (2) strand cleavage, strand invasion, and metabolic steps leading to the 

production of recombinant chiasma; and finally (3) the resolution of chiasma into discrete 
recombined molecules. The formation of the chiasma requires the recognition of 
homologous sequences. 

[000152] Thus, the invention includes a method for producing a chimeric or recombinant 
1 5 polynucleotide from at least a first polynucleotide and a second polynucleotide in vivo. The 
invention can be used to produce a recombinant polynucleotide by introducing at least a first 
polynucleotide and a second polynucleotide which share at least one region of partial 
sequence homology (e.g., the Group A nucleic acid sequences, and combinations thereof) 
into a suitable host cell. The regions of partial sequence homology promote processes which 
20 result in sequence reorganization producing a recombinant polynucleotide. Such hybrid 
polynucleotides can result from intermolecular recombination events which promote 
sequence integration between DNA molecules. In addition, such hybrid polynucleotides can 
result from intramolecular reductive reassortment processes which utilize repeated sequences 
to alter a nucleotide sequence within a DNA molecule. 

25 [000153] The invention provides a means for generating recombinant polynucleotides 

which encode biologically active variant polypeptides (e.g., a nitrilase variant). For example, 
a polynucleotide may encode a particular enzyme from one microorganism. An enzyme 
encoded by a first polynucleotide from one organism can, for example, function effectively 
under a particular environmental condition, e.g., high salinity. An enzyme encoded by a 

JO second polynucleotide from a different organism can function effectively under a different 

environmental condition, such as extremely high temperature. A recombined polynucleotide 

containing sequences from the first and second original polynucleotides encodes a variant 

37 



WO 03/097810 PCT/US03/15712 
enzyme which exhibits characteristics of both enzymes encoded by the original 
polynucleotides. Thus, the enzyme encoded by the recombined polynucleotide can function 
effectively under environmental conditions shared by each of the enzymes encoded by the 
first and second polynucleotides, e.g., high salinity and extreme temperatures. 

5 [000154J A variant polypeptide can exhibit specialized enzyme activity not displayed in the 
original enzymes. For example, following recombination and/or reductive reassortment of 
polynucleotides encoding nitrilase activity, the resulting variant polypeptide encoded by a 
recombined polynucleotide can be screened for specialized nitrilase activity obtained from 
each of the original enzymes, i.e., the temperature or pH at which the nitrilase functions. 

10 Sources of the original polynucleotides may be isolated from individual organisms 

("isolates"), collections of organisms that have been grown in defined media ("enrichment 
cultures''), or, uncultivated organisms ("environmental samples"). The use of a culture- 
independent approach to derive polynucleotides encoding novel bioactivities from 
environmental samples is most preferable since it allows one to access untapped resources of 

1 5 biodiversity. The microorganisms from which the polynucleotide may be prepared include 
prokaryotic microorganisms, such asXanthobacter, Eubacteria andArchaebacteria, and 
lower eukaryotic microorganisms such as fungi, some algae and protozoa. Polynucleotides 
may be isolated from environmental samples in which case the nucleic acid may be recovered 
without culturing of an organism or recovered from one or more cultured organisms. In one 

20 aspect, such microorganisms may be extremophiles, such as hyperthermophiles, 

psychrophiles, psychrotrophs, halophiles, barophiles and acidophiles. Polynucleotides 
encoding enzymes isolated from extremophilic microorganisms are particularly preferred. 
Such enzymes may function at temperatures above 100°C in terrestrial hot springs and deep 
sea thermal vents, at temperatures below 0°C in arctic waters, in the saturated salt 

25 environment of the Dead Sea, at pH values around 0 in coal deposits and geothermal sulfur- 
rich springs, or at pH values greater than 1 1 in sewage sludge. 

[000155] Examples of mammalian expression systems that can be employed to express 
recombinant proteins include the COS-7, CI 27, 3T3, CHO, HeLa and BHK cell lines. 
Mammalian expression vectors comprise an origin of replication, a suitable promoter and • 
30 enhancer, and also any necessary ribosome binding sites, polyadenylation site, splice donor 
and acceptor sites, transcriptional termination sequences, and 5 f flanking nontranscribed 
sequences. DNA sequences derived from the SV40 splice and polyadenylation sites may be 
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used to provide the required nontranscribed genetic elements. U.S. Patent No. 6,054,267 is 
hereby incorporated by reference in its entirety. 

[000156] Host cells containing the polynucleotides of interest can be cultured in 
conventional nutrient media modified as appropriate for activating promoters, selecting 
5 transformants or amplifying genes. The culture conditions, such as temperature, pH and the 
like, are those previously used with the host cell selected for expression, and will be apparent 
to the ordinarily skilled artisan. Clones, which are identified as having a desired enzyme 
activity or other property may then be sequenced to identify the recombinant polynucleotide 
sequence encoding the enzyme having the desired activity or property. 

10 [000157] In one aspect, the invention provides for the isolated nitrilases as either an isolated 
nucleic acid or an isolated polypeptide wherein the nucleic acid or the polypeptide was 
prepared by recovering DNA from a DNA population derived from at least one uncultivated 
microorganism, and transforming a host with recovered DNA to produce a library of clones 
which is screened for the specified protein, e.g. nitrilase activity. U.S. Patent No. 6,280,926, 

15 Short, provides descriptions of such methods and is hereby incorporated by reference in its 
entirety for all purposes. 

[000158] Therefore, in a one aspect, the invention relates to a method for producing a 
biologically active recombinant nitrilase polypeptide and screening such a polypeptide for 
desired activity or property by: 

20 1) introducing at least a first nitrilase polynucleotide and a second nitrilase 

polynucleotide, said at least first nitrilase polynucleotide and second nitrilase polynucleotide 
sharing at least one region of sequence homology, into a suitable host cell; 

2) growing the host cell under conditions which promote sequence reorganization 
resulting in a recombinant nitrilase polynucleotide; 

25 3) expressing a recombinant nitrilase polypeptide encoded by the recombinant 

nitrilase polynucleotide; 

4) screening the recombinant nitrilase polypeptide for the desired activity or 
property; and 

5) isolating the recombinant nitrilase polynucleotide encoding the recombinant 
30 nitrilase polypeptide. 
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[000159] Examples of vectors which may be used include viral particles, baculovirus, 
phage, plasmids, phagemids, cosmids, fosmids, bacterial artificial chromosomes, viral DNA 
{e.g., vaccinia, adenovirus, fowlpox virus, pseudorabies and derivatives of SV40), PI -based 
artificial chromosomes, yeast plasmids, yeast artificial chromosomes, and any other vectors 
5 specific for the hosts of interest (e.g. , Bacillus, Aspergillus and yeast). Large numbers of 
suitable vectors are known to those of skill in the art, and are commercially available. 
Examples of bacterial vectors include pQE vectors (Qiagen, Valencia, CA); pBluescript 
plasmids, pNH vectors, and lambda-ZAP vectors (Stratagene, La Jolla, CA); and pTRC99a, 
pKK223-3, pDR540, and pRTT2T vectors (Pharmacia, Peapack, NJ). Examples of eukaryotic 
10 vectors include pXTl and pSG5 vectors (Stratagene, La Jolla, CA); and pSVK3, pBPV, 
pMSG, and pSVLSV40 vectors (Pharmacia, Peapack, NJ). However, any other plasmid or 
other vector may be used so long as they are replicable and viable in the host 

[000160] A preferred type of vector for use in the present invention contains an f-factor (or 
fertility factor) origin of replication. The f-factor in E. coli is a plasmid which effects high 
1 5 frequency transfer of itself during conjugation and less frequent transfer of the bacterial 
chromosome itself. A particularly preferred aspect is to use cloning vectors referred to as 
"fosmids" or bacterial artificial chromosome (BAC) vectors. These are derived from E. coli 
f-factor which is able to stably integrate large segments of genomic DNA. 

[000161] The DNA sequence in the expression vector is operably joined to appropriate 
20 expression control sequences, including a promoter, to direct RNA synthesis. Useful 

bacterial promoters include lad, lacZ, T3, T7, gpt, lambda Pr, P l and trp. Useful eukaryotic 
promoters include CMV immediate early, HSV thymidine kinase, early and late SV40, LTRs 
from retrovirus, and mouse metallothionein-I. Selection of the appropriate vector and 
promoter is well within the level of ordinary skill in the art. The expression vector also 
25 contains a ribosome binding site for translation initiation and a transcription terminator. The 
vector may also include appropriate sequences for amplifying expression. Promoter regions 
can be selected from any desired gene using CAT (chloramphenicol transferase) vectors or 
other vectors with selectable markers. 

[000162] In addition, the expression vectors can contain one or more selectable marker 
30 genes to provide a phenotypic trait for selection of transformed host cells. Useful selectable 
markers include dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, or 
tetracycline or ampicillin resistance in E. coli. 
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[000163] The vector may be introduced into the host cells using any of a variety of 
techniques, including transformation, transfection, transduction, viral infection, gene guns, or 
Ti-mediated gene transfer. Particular methods include calcium phosphate transfection, 
DEAE-Dextran mediated transfection, lipofection, or electroporation 

5 [0001 64] Reductive Reassortment - In another aspect, variant nitrilase polynucleotides can 
be generated by the process of reductive reassortment. Whereas recombination is an "inter- 
molecular" process which, in bacteria, is generally viewed as a "recA-dependent" 
phenomenon, the process of "reductive reassortment" occurs by an "intra-molecular", recA- 
independent process. In this aspect, the invention can rely on the ability of cells to mediate 

10 reductive processes to decrease the complexity of quasi-repeated sequences in the cell by 
deletion. The method involves the generation of constructs containing consecutive repeated 
or quasi-repeated sequences (original encoding sequences), the insertion of these sequences 
into an appropriate vector, and the subsequent introduction of the vector into an appropriate 
host cell. The reassortment of the individual molecular identities occurs by combinatorial 

15 processes between the consecutive sequences in the construct possessing regions of 

homology, or between quasi-repeated units. The reassortment process recombines and/or 
reduces the complexity and extent of the repeated sequences, and results in the production of 
novel molecular species. Various treatments may be applied to enhance the rate of 
reassortment, such as ultra-violet light or DNA damaging chemicals. In addition, host cell 

20 lines displaying enhanced levels of "genetic instability" can be used. 

[000165] Repeated Sequences - Repeated or "quasi-repeated" sequences play a role in 
genetic instability. In the present invention, "quasi-repeats" are repeats that are not identical 
in structure but, rather, represent an array of consecutive sequences which have a high degree 
of similarity or identity sequences. The reductive reassortment or deletion process in the cell 
25 reduces the complexity of the resulting construct by deleting sequences between positions 
within quasi-repeated sequences. Because the deletion (and potentially insertion) events can 
occur virtually anywhere within the quasi-repetitive units, these sequences provide a large 
repertoire of potential variants. 

[000166] When the quasi-repeated sequences are all ligated in the same orientation, for 

30 instance head-to-tail or vice versa, the endpoints of a deletion are, for the most part, equally 

likely to occur anywhere within the quasi-repeated sequences. In contrast, when the units are 

presented head-to-head or tail-to-tail, the inverted quasi-repeated sequences can form a 
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duplex which delineates the endpoints of the adjacent units and thereby favors deletion of 
discrete units. Therefore, it is preferable in the present invention that the quasi-repeated 
sequences are joined in the same orientation because random orientation of quasi-repeated 
sequences will result in the loss of reassortment efficiency, while consistent orientation of the 
5 sequences will offer the highest efficiency. Nonetheless, although having fewer of the 
contiguous sequences in the same orientation decreases the efficiency or reductive 
reassortment, it may still provide sufficient variation for the effective recovery of novel 
molecules. 

[000167] Sequences can be assembled in a head-to-tail orientation using any of a variety of 
1 0 methods, including the following: 

a) Primers can be utilized that include a poly-A head and poly-T tail which, when 
made single-stranded, would provide orientation. This is accomplished by having the first 
few bases of the primers made from RNA and hence easily removed by RNAse H. b) 

Primers can be utilized that include unique restriction cleavage sites. Multiple sites, a 
15 battery of unique sequences, and repeated synthesis and ligation steps would be required. 

c) The inner few bases of the primer can be thiolated and an exonuclease used to 
produce properly tailed molecules. 

[000168] The recovery of the reassorted sequences relies on the identification of cloning 
vectors with a reduced repetitive index (RI). The reassorted coding sequences can then be 
20 recovered by amplification. The products are recloned and expressed. The recovery of 
cloning vectors with reduced RI can be effected by: 

1 ) The use of vectors only stably maintained when the construct is reduced in 
complexity. 

2) The physical recovery of shortened vectors by physical procedures. In this 
25 case, the cloning vector would be recovered using standard plasmid isolation procedures and 

then size-fractionated using standard procedures (e.g., agarose gel or column with a low 
molecular weight cut off). 

3) The recovery of vectors containing interrupted genes can be selected when 
insert size decreases. 

30 4) The use of direct selection techniques wherein an expression vector is used 

and the appropriate selection is carried out. 
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[000169] Coding sequences from related organisms may demonstrate a high degree of 
homology but nonetheless encode quite diverse protein products. These types of sequences 
are particularly useful in the present invention as quasi-repeats. However, while the 
examples illustrated below demonstrate the reassortment of coding sequences with a high 
5 degree of identity (quasi-repeats), this process is not limited to nearly identical repeats. 

[000170] The following example demonstrates a method of the invention. Quasi-repetitive 
coding sequences derived from three different species are obtained. Each sequence encodes a 
protein with a distinct set of properties. Each of the sequences differs by one or more base 
pairs at unique positions in the sequences which are designated "A", "B" and "C'\ The 
10 quasi-repeated sequences are separately or collectively amplified and ligated into random 
assemblies such that all possible permutations and combinations are available in the 
population of ligated molecules. The number of quasi-repeat units can be controlled by the 
assembly conditions. The average number of quasi-repeated units in a construct is defined as 
the repetitive index (RI). 

15 [000171] Once formed, the constructs can be size-fractionated on an agarose gel according 
to published protocols, inserted into a cloning vector, and transfected into an appropriate host 
cell. The cells can then be propagated to allow reductive reassortment to occur. The rate of 
the reductive reassortment process may be stimulated by the introduction of DNA damage if 
desired. Whether the reduction in RI is mediated by deletion formation between repeated 

£0 sequences by an "intra-molecular" mechanism, or mediated by recombination-like events 
through "inter-molecular" mechanisms is immaterial. The end result is a reassortment of the 
molecules into all possible combinations. 

[000172] In another aspect, prior to or during recombination or reassortment, 
polynucleotides of the invention or polynucleotides generated by the methods described 

15 herein can be subjected to agents or processes which promote the introduction of mutations 
into the original polynucleotides. The introduction of such mutations would increase the 
diversity of resulting hybrid polynucleotides and polypeptides encoded therefrom. The 
agents or processes which promote mutagenesis include, but are not limited to: (+)-CC-1065, 
or a synthetic analog such as (+)-CC-1065-(N3-adenine) (Sun et al. (1992), Biochemistry 

10 31(10):2822-9); an N-acetylated or deacetylated 4 ! -fluoro-4-aminobiphenyl adduct capable of 

inhibiting DNA synthesis (see, for example, van de Poll et al. (1992), Carcinogenesis 

13(5):751-8); or a N-acetylated or deacetylated 4-aminobiphenyl adduct capable of inhibiting 
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DNA synthesis (see also, Van de Poll et al. (1992), supra); trivalent chromium, a trivalent 
chromium salt, a polycyclic aromatic hydrocarbon ('TAH") DNA adduct capable of 
inhibiting DNA replication, such as 7-bromomethyl-benz[a]anthracene ("BMA"), tris(2,3- 
dibromopropyl)phosphate ("Tris-BP"), l,2-dibromo-3-chloropropane("DBCP"), 2- 
5 bromoacrolein (2B A), benzo[a]pyrene-7,8-dihydrodiol-9- 1 0-epoxide ("BPDE"), a 
platinum(II) halogen salt, N-hydroxy-2-amino-3-methylimidazo[4 J 5-f]-quinoline ("N- 
hydroxy-IQ"), andN-hydroxy-2-amino-l-methyl-6-phenylimidazo[4,5-f]-pyridine ( <C N- 
hydroxy-PhlP"). Especially preferred means for slowing or halting PCR amplification 
consist of UV light (+)-CC-1065 and (+)-CC-1065-(N3-Adenine). Particularly encompassed 
10 means are DNA adducts or polynucleotides comprising the DNA adducts from the 

polynucleotides or polynucleotides pool, which can be released or removed by a process 
including heating the solution comprising the polynucleotides prior to further processing. 

[000173] GSSM™ - The invention also provides for the use of codon primers containing a 
degenerate N,N,G/T sequence to introduce point mutations into a polynucleotide, so as to 

1 5 generate a set of progeny polypeptides in which a full range of single amino acid 

substitutions is represented at each amino acid position, a method referred to as gene site- 
saturated mutagenesis (GSSM™). The oligos used are comprised contiguously of a first 
homologous sequence, a degenerate N,N,G/T sequence, and possibly a second homologous 
sequence. The progeny translational products from the use of such oligos include all possible 

20 amino acid changes at each amino acid site along the polypeptide, because the degeneracy of 
the N,N,G/T sequence includes codons for all 20 amino acids. 

[000174] In one aspect, one such degenerate oligo (comprising one degenerate N,N,G/T 
cassette) is used for subjecting each original codon in a parental polynucleotide template to a 
full range of codon substitutions. In another aspect, at least two degenerate N,N,G/T 

25 cassettes are used - either in the same oligo or not, for subjecting at least two original codons 
in a parental polynucleotide template to a full range of codon substitutions. Thus, more than 
one N,N,G/T sequence can be contained in one oligo to introduce amino acid mutations at 
more than one site. This plurality of N,N,G/T sequences can be directly contiguous, or 
separated by one or more additional nucleotide sequences. In another aspect, oligos 

30 serviceable for introducing additions and deletions can be used either alone or in combination 
with the codons containing an N,N,G/T sequence, to introduce any combination or 
permutation of amino acid additions, deletions, and/or substitutions. 
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[000175] In a particular exemplification, it is possible to simultaneously mutagenize two or 
more contiguous amino acid positions using an oligo that contains contiguous N,N,G/T 
triplets, i.e., a degenerate (N,N,G/T) n sequence. 

[000176] In another aspect, the present invention provides for the use of degenerate 
5 cassettes having less degeneracy than the N,N,G/T sequence. For example, it may be 
desirable in some instances to use a degenerate triplet sequence comprised of only one N, 
where said N can be in the first second or third position of the triplet. Any other bases 
including any combinations and permutations thereof can be used in the remaining two 
positions of the triplet. Alternatively, it may be desirable in some instances to use a 
1 0 degenerate N ,N,N triplet sequence, or an N,N, G/C triplet sequence. 

[000177] It is appreciated, however, that the use of a degenerate triplet (such as N,N,G/T or 
an N,N, G/C triplet sequence) as disclosed in the instant invention is advantageous for several 
reasons. In one aspect, this invention provides a means to systematically and fairly easily 
generate the substitution of the full range of the 20 possible amino acids into each and every 
15 amino acid position in a polypeptide. Thus, for a 100 amino acid polypeptide, the invention 
provides a way to systematically and fairly easily generate 2000 distinct species (i.e., 20 
possible amino acids per position times 100 amino acid positions). It is appreciated that there 
is provided, through the use of an oligo containing a degenerate N,N,G/T or an N,N, G/C 
triplet sequence, 32 individual sequences that code for the 20 possible amino acids. Thus, in 
20 a reaction vessel in which a parental polynucleotide sequence is subjected to saturation 
mutagenesis using one such oligo, there are generated 32 distinct progeny polynucleotides 
encoding 20 distinct polypeptides. In contrast, the use of a non-degenerate oligo in site- 
directed mutagenesis leads to only one progeny polypeptide product per reaction vessel. 

[000178] This invention also provides for the use of nondegenerate oligonucleotides, which 
can optionally be used in combination with degenerate primers disclosed. It is appreciated 
that in some situations, it is advantageous to use nondegenerate oligos to generate specific 
point mutations in a working polynucleotide. This provides a means to generate specific 
silent point mutations, point mutations leading to corresponding amino acid changes, and 
point mutations that cause the generation of stop codons and the corresponding expression of 
polypeptide fragments. 



25 
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[000179] Thus, in one aspect, each saturation mutagenesis reaction vessel contains 
polynucleotides encoding at least 20 progeny polypeptide molecules such that all 20 amino 
acids are represented at the one specific amino acid position corresponding to the codon 
position mutagenized in the parental polynucleotide. The 32-fold degenerate progeny 
polypeptides generated from each saturation mutagenesis reaction vessel can be subjected to 
clonal amplification (e.g., cloned into a suitable E. coli host using an expression vector) and 
subjected to expression screening. When an individual progeny polypeptide is identified by 
screening to display a favorable change in property (when compared to the parental 
polypeptide), it can be sequenced to identify the correspondingly favorable amino acid 
substitution contained therein. 

[000180] It is appreciated that upon mutagenizing each and every amino acid position in a 
parental polypeptide using saturation mutagenesis as disclosed herein, favorable amino acid 
changes may be identified at more than one amino acid position. One or more new progeny 
molecules can be generated that contain a combination of all or part of these favorable amino 
15 acid substitutions. For example, if 2 specific favorable amino acid changes are identified in 
each of 3 amino acid positions in a polypeptide, the permutations include 3 possibilities at 
each position (no change from the original amino acid, and each of two favorable changes) 
and 3 positions. Thus, there are 3 x 3 x 3 or 27 total possibilities, including 7 that were 
previously examined - 6 single point mutations (z.e., 2 at each of three positions) and no 
20 change at any position. 

[000181] In yet another aspect, site-saturation mutagenesis can be used together with 
shuffling, chimerization, recombination and other mutagenizing processes, along with 
screening. This invention provides for the use of any mutagenizing process(es), including 
saturation mutagenesis, in an iterative manner. In one exemplification, the iterative use of 
25 any mutagenizing processes) is used in combination with screening. 

[000182] Thus, in a non-limiting exemplification, polynucleotides and polypeptides of the 
invention can be derived by saturation mutagenesis in combination with additional 
mutagenization processes, such as process where two or more related polynucleotides are 
introduced into a suitable host cell such that a hybrid polynucleotide is generated by 
30 recombination and reductive reassortment. 
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[000183] In addition to performing mutagenesis along the entire sequence of a gene, 
mutagenesis can be used to replace each of any number of bases in a polynucleotide 
sequence, wherein the number of bases to be mutagenized can be each integer from about 15 
to about 100,000. Thus, instead of mutagenizing every position along a molecule, one can 

5 subject every or a discrete number of bases {e.g., a subset totaling from about 15 to about 
100,000) to mutagenesis. In one aspect, a separate nucleotide is used for mutagenizing each 
position or group of positions along a polynucleotide sequence. A group of 3 positions to be 
mutagenized can be a codon. In one aspect, the mutations are introduced using a mutagenic 
primer, containing a heterologous cassette, also referred to as a mutagenic cassette. For 

10 example, cassettes can have from about 1 to about 500 bases. Each nucleotide position in 
such heterologous cassettes can be N, A, C, G, T, A/C, A/G, A/T, C/G, C/T, G/T, C/G/T, 
A/G/T, A/C/T, A/C/G, or E, where E is any base that is not A, C, G, or T. 

[000184] In a general sense, saturation mutagenesis comprises mutagenizing a complete set 
of mutagenic cassettes (for example, each cassette is about 1-500 bases in length) in a defined 

15 polynucleotide sequence to be mutagenized (for example, the sequence to be mutagenized is 
from about 15 to about 100,000 bases in length). Thus, a group of mutations (ranging from 
about 1 to about 100 mutations) is introduced into each cassette to be mutagenized. A 
grouping of mutations to be introduced into one cassette can be different or the same from a 
second grouping of mutations to be introduced into a second cassette during the application 

20 of one round of saturation mutagenesis. Such groupings are exemplified by deletions, 

additions, groupings of particular codons, and groupings of particular nucleotide cassettes. 

[000185] Defined sequences to be mutagenized include a whole gene, pathway, cDNA, 
entire open reading frame (ORF), promoter, enhancer, repressor/transactivator, origin of 
replication, intron, operator, or any polynucleotide functional group. Generally, a "defined 
25 sequence" for this purpose may be any polynucleotide that a 1 5 base-polynucleotide 

sequence, and polynucleotide sequences of lengths between about 15 bases and about 15,000 
bases (this invention specifically names every integer in between). Considerations in 
choosing groupings of codons include types of amino acids encoded by a degenerate 
mutagenic cassette. 

?0 [000186] In a particularly preferred exemplification a grouping of mutations that can be 

introduced into a mutagenic cassette, this invention specifically provides for degenerate 

codon substitutions (using degenerate oligos) that code for 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 
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14, 15, 16, 17, 18, 19, and 20 amino acids at each position, and a library of polypeptides 
encoded thereby. 



[000187] One aspect of the invention is an isolated nucleic acid comprising one of the 
sequences of the Group A nucleic acid sequences, sequences substantially identical thereto, 

5 sequences complementary thereto, or a fragment comprising at least 10, 15, 20, 25, 30, 35, 
40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive bases of one of the sequences of the 
Group A nucleic acid sequences. The isolated nucleic acids may comprise DNA, including 
cDNA, genomic DNA, and synthetic DNA. The DNA may be double-stranded or single- 
stranded, and if single stranded may be the coding strand or non-coding (anti-sense) strand. 

1 0 Alternatively, the isolated nucleic acids may comprise RNA. 

[000188] As discussed in more detail below, the isolated nucleic acid sequences of the 
invention may be used to prepare one of the polypeptides of the Group B amino acid 
sequences, and sequences substantially identical thereto, or fragments comprising at least 5, 
10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids of one of the 
15 polypeptides of the Group B amino acid sequences, and sequences substantially identical 
thereto. 

[000189] Alternatively, the nucleic acid sequences of the invention may be mutagenized 
using conventional techniques, such as site directed mutagenesis, or other techniques familiar 
to those skilled in the art, to introduce silent changes into the polynucleotides of the Group A 
20 nucleic acid sequences, and sequences substantially identical thereto. As used herein, "silent 
changes" include, for example, changes which do not alter the amino acid sequence encoded 
by the polynucleotide. Such changes may be desirable in order to increase the level of the 
polypeptide produced by host cells containing a vector encoding the polypeptide by 
introducing codons or codon pairs which occur frequently in the host organism. 

25 [000190] The invention also relates to polynucleotides which have nucleotide changes 
which result in amino acid substitutions, additions, deletions, fusions and truncations in the 
polypeptides of the invention (e.g., the Group B amino acid sequences). Such nucleotide 
changes may be introduced using techniques such as site-directed mutagenesis, random 
chemical mutagenesis, exonuclease HI deletion, and other recombinant DNA techniques. 

30 Alternatively, such nucleotide changes may be naturally occurring allelic variants which are 
isolated by identifying nucleic acid sequences which specifically hybridize to probes 
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comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 
consecutive bases of one of the sequences of the Group A nucleic acid sequences, and 
sequences substantially identical thereto (or the sequences complementary thereto) under 
conditions of high, moderate, or low stringency as provided herein. 

5 Immobilized Enzyme Solid Supports 

[000191] The enzymes, fragments thereof and nucleic acids which encode the enzymes and 
fragments can be affixed to a solid support. This is often economical and efficient in the use 
of the enzymes in industrial processes. For example, a consortium or cocktail of enzymes (or 
active fragments thereof), which are used in a specific chemical reaction, can be attached to a 

10 solid support and dunked into a process vat. The enzymatic reaction can occur. Then, the 
solid support can be taken out of the vat, along with the enzymes affixed thereto, for repeated 
use. In one aspect of the invention, the isolated nucleic acid is affixed to a solid support. In 
another aspect of the invention, the solid support is selected from the group of a gel, a resin, a 
polymer, a ceramic, a glass, a microelectrode and any combination thereof. 

15 [000192] For example, solid supports useful in this invention include gels. Some examples 
of gels include sepharose, gelatin, glutaraldehyde, chitosan-treated glutaraldehyde, albumin- 
glutaraldehyde, chitosan-Xanthan, toyopearl gel (polymer gel), alginate, alginate-polylysine, 
carrageenan, agarose, glyoxyl agarose, magnetic agarose, dextran-agarose, poly(Carbamoyl 
Sulfonate) hydrogel, BSA-PEG hydrogel, phosphorylated polyvinyl alcohol (PVA), 

10 monoaminoethyl-N-aminoethyl (MANA), amino, or any combination thereof. 

[000193] Another solid support useful in the present invention are resins or polymers. 
Some examples of resins or polymers include cellulose, acrylamide, nylon, rayon, polyester, 
anion-exchange resin, AMBERLITE™ XAD-7, AMBERLITE™ XAD-8, AMBERLITE™ 
IRA-94, AMBERLITE™ IRC-50, polyvinyl, polyacrylic, polymethacrylate, or any 

15 combination thereof. Another type of solid support useful in the present invention is ceramic. 
Some examples include non-porous ceramic, porous ceramic, Si(>2, AI2O3. Another type of 
solid support useful in the present invention is glass. Some examples include non-porous 
glass, porus glass, aminopropyl glass or any combination thereof Another type of solid 
support which can be used is a mcroelectrode. An example is a polyethyleneimine-coated 

JO magnetite. Graphitic particles can be used as a solid support. Another example of a solid 
support is a cell, such as a red blood cell. 
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Methods of immobilization 

[000194] There are many methods which would be known to one of skill in the art for 
immobilizing enzymes or fragments thereof, or nucleic acids, onto a solid support. Some 
examples of such methods include electrostatic droplet generation, electrochemical means, 

5 via adsorption, via covalent binding, via cross-linking, via a chemical reaction or process, via 
encapsulation, via entrapment, via calcium alginate, or via poly (2-hydroxyethyl 
methacrylate). Like methods are described in Methods in Enzymology, Immobilized Enzymes 
aw/ Ce/k, Part C. 1987. Academic Press. Edited by S. P. Colowick and N. O. Kaplan. 
Volume 136; and Immobilization of Enzymes and Cells. 1997. Humana Press. Edited by G. 

10 F. Bickerstaff. Series: Methods in Biotechnology, Edited by J. M. Walker. 

[000195] Probes - The isolated nucleic acids of the Group A nucleic acid sequences, 
sequences substantially identical thereto, complementary sequences, or a fragment 
comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 
consecutive bases of one of the foregoing sequences may also be used as probes to determine 

15 whether a biological sample, such as a soil sample, contains an organism having a nucleic 
acid sequence of the invention or an organism from which the nucleic acid was obtained. In 
such procedures, a biological sample potentially harboring the organism from which the 
nucleic acid was isolated is obtained and nucleic acids are obtained from the sample. The 
nucleic acids are contacted with the probe under conditions which permit the probe to 

10 specifically hybridize to any complementary sequences which are present therein. 

[000196] Where necessary, conditions which permit the probe to specifically hybridize to 
complementary sequences may be determined by placing the probe in contact with 
complementary sequences from samples known to contain the complementary sequence as 
well as control sequences which do not contain the complementary sequence. Hybridization 

15 conditions, such as the salt concentration of the hybridization buffer, the formamide 

concentration of the hybridization buffer, or the hybridization temperature, can be varied to 
identify conditions which allow the probe to hybridize specifically to complementary nucleic 
acids. Stringent hybridization conditions are recited herein. 

[000197] Hybridization may be detected by labeling the probe with a detectable agent such 
50 as a radioactive isotope, a fluorescent dye or an enzyme capable of catalyzing the formation 
of a detectable product. Many methods for using the labeled probes to detect the presence of 
complementary nucleic acids in a sample are familiar to those skilled in the art. These 
include Southern Blots, Northern Blots, colony hybridization procedures, and dot blots. 
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Protocols for each of these procedures are provided in Ausubel et al. (1997), Current 
Protocols in Molecular Biology, John Wiley & Sons, Inc., and Sambrook et al. (1989), 
Molecular Cloning: A Laboratory Manual 2d Ed., Cold Spring Harbor Laboratory Press, the 
entire disclosures of which are incorporated herein by reference. 
5 [000198] In one example, a probe DNA is "labeled" with one partner of a specific binding 
pair (z.e., a ligand) and the other partner of the pair is bound to a solid matrix to provide ease 
of separation of target from its source. For example, the ligand and specific binding partner 
can be selected from, in either orientation, the following: (1) an antigen or hapten and an 
antibody or specific binding fragment thereof; (2) biotin or iminobiotin and avidin or 

10 streptavidin; (3) a sugar and a lectin specific therefor; (4) an enzyme and an inhibitor 
therefor; (5) an apoenzyme and cofactor; (6) complementary homopolymeric 
oligonucleotides; and (7) a hormone and a receptor therefor. In one example, the solid phase 
is selected from: (1) a glass or polymeric surface; (2) a packed column of polymeric beads; 
and (3) magnetic or paramagnetic particles. 

1 5 [000199] Alternatively, more than one probe (at least one of which is capable of specifically 
hybridizing to any complementary sequences which are present in the nucleic acid sample), 
may be used in an amplification reaction to determine whether the sample contains an 
organism containing a nucleic acid sequence of the invention {e.g., an organism from which 
the nucleic acid was isolated). Typically, the probes comprise oligonucleotides. In one 

20 aspect, the amplification reaction may comprise a PGR reaction. PCR protocols are 

described in Ausubel et al. (1997), supra, and Sambrook et al. (1989), supra. Alternatively, 
the amplification may comprise a ligase chain reaction, 3SR, or strand displacement reaction. 
(See Barany (1991), PCR Methods and Applications 1:5-16; Fahy et al. (1991), PCR Methods 
and Applications 1:25-33; and Walker et al. (1992), Nucleic Acid Research 20:1691-1696, 

25 the disclosures of which are incorporated herein by reference in their entireties). 

[000200] Probes derived from sequences near the ends of a sequence as set forth in the 
Group A nucleic acid sequences, and sequences substantially identical thereto, may also be 
used in chromosome walking procedures to identify clones containing genomic sequences 
located adjacent to the nucleic acid sequences as set forth above. Such methods allow the 

30 isolation of genes which encode additional proteins from the host organism. 

[000201] An isolated nucleic acid sequence as set forth in the Group A nucleic acid 
sequences, sequences substantially identical thereto, sequences complementary thereto, or a 
fragment comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 
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500 consecutive bases of one of the foregoing sequences may be used as probes to identify 
and isolate related nucleic acids. In some aspects, the related nucleic acids may be cDNAs or 
genomic DNAs from organisms other than the one from which the nucleic acid was isolated. 
For example, the other organisms may be related organisms. In such procedures, a nucleic 
5 acid sample is contacted with the probe under conditions which permit the probe to 

specifically hybridize to related sequences. Hybridization of the probe to nucleic acids from 
the related organism is then detected using any of the methods described above. 
[000202] In nucleic acid hybridization reactions, the conditions used to achieve a particular 
level of stringency will vary, depending on the nature of the nucleic acids being hybridized. 

1 0 For example, the length of the nucleic acids, the amount of complementarity between the 
nucleic acids, the nucleotide sequence composition (e.g., G-C rich v. A-T rich content), and 
the nucleic acid type (e.g., RNA v. DNA) can be considered in selecting hybridization 
conditions. Stringency may be varied by conducting the hybridization at varying 
temperatures below the melting temperatures of the probes. The melting temperature, Tm, is 

1 5 the temperature (under defined ionic strength and pH) at which 50% of the target sequence 
hybridizes to a perfectly complementary probe. Stringent conditions are selected to be equal 
to or about 5°C lower than the Tm for a particular probe. The melting temperature of the 
probe may be calculated using the following formulas: 

[000203] For probes between 1 4 and 70 nucleotides in length the melting temperature (Tm) 
20 is calculated using the formula: Tm=81 .5+16.6(log [Na + ])+0.41 (fraction G+Q-(600/N) 
where N is the length of the probe. 

[000204] If the hybridization is carried out in a solution containing formamide, the melting 
temperature may be calculated using the equation: Tm=81.5+16.6(log [Na^yHJ.41 (fraction 
G+C)-(0.63% formamide>(600/N) where N is the length of the probe. 

25 [000205] Expression Libraries - Expression libraries can be created using the 

polynucleotides of the invention in combination with expression vectors and appropriate host 
cells. The library allows for the in vivo expression of the polypeptides which are encoded by 
the polynucleotides of the invention. After such expression libraries have been generated one 
can include the additional step of "biopanning" such libraries prior to screening by cell 

30 sorting. The "biopanning" procedure refers to a process for identifying clones having a 

specified biological activity by screening for sequence identity in a library of clones prepared 
by (i) selectively isolating target DNA derived from at least one microorganism by use of at 
least one probe DNA comprising at least a portion of a DNA sequence encoding a 
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polypeptide having a specified biological activity (e.g., nitrilase activity); and (ii) optionally 
transforming a host with the isolated target DNA to produce a library of clones which are 
screened for the specified biological activity. 

[000206] The probe DNA used for selectively isolating the target DNA of interest from the 
5 DNA derived from at least one microorganism can be a full-length coding region sequence or 
a partial coding region sequence of DNA for an enzyme of known activity. The original 
DNA library can be probed using mixtures of probes comprising at least a portion of DNA 
sequences encoding enzymes having the specified enzyme activity. These probes or probe 
libraries are single-stranded and the microbial DNA which is probed has been converted into 
10 single-stranded form. The probes that are particularly suitable are those derived from DNA 
encoding enzymes having an activity similar or identical to the specified enzyme activity that 
is to be screened. 

[000207] Having prepared a multiplicity of clones from DNA selectively isolated from an 
organism, such clones are screened for a specific enzyme activity and to identify the clones 

1 5 having the specified enzyme characteristics. 

[000208] The screening for enzyme activity may be affected on individual expression 
clones or may be initially affected on a mixture of expression clones to ascertain whether or 
not the mixture has one or more specified enzyme activities. If the mixture has a specified 
enzyme activity, then the individual clones may be rescreened for such enzyme activity or for 

20 a more specific activity. Thus, for example, if a clone mixture has nitrilase activity, then the 
individual clones may be recovered and screened to determine which of such clones has 
nitrilase activity. 

[000209] As described with respect to one of the above aspects, the invention provides a 
process for enzyme activity screening of clones containing selected DNA derived from a 

25 microorganism which process includes: screening a library for specified enzyme activity, 
said library including a plurality of clones, said clones having been prepared by recovering 
from genomic DNA of a microorganism selected DNA, which DNA is selected by 
hybridization to at least one DNA sequence which is all or a portion of a DNA sequence 
encoding an enzyme having the specified activity; and transforming a host with the selected 

JO DNA to produce clones which are screened for the specified enzyme activity. 

[000210] In one aspect, a DNA library derived from a microorganism is subjected to a 
selection procedure to select therefrom DNA which hybridizes to one or more probe DNA 
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sequences which is all or a portion of a DNA sequence encoding an enzyme having the 
specified enzyme activity by: 

(a) contacting the single-stranded DNA population from the DNA library with the 
DNA probe bound to a ligand under stringent hybridization conditions so as to produce a 
duplex between the probe and a member of the DNA library; 

(b) contacting the duplex with a solid phase specific binding partner for the ligand so 
as to produce a solid phase complex; 

(c) separating the solid phase complex from the non-duplexed members of the DNA 

library; 

(d) denaturing the duplex to release the member of the DNA library; 

(e) creating a complementary DNA strand of the member from step (d) so as to make the 
member a double-stranded DNA; 

(f) introducing the double-stranded DNA into a suitable host so as to express a 
polypeptide which is encoded by the member DNA; and 

(g) determining whether the polypeptide expressed exhibits the specified enzymatic 
activity. 

[00021 1] In another aspect, the process includes a preselection to recover DNA including 
signal or secretion sequences. In this manner it is possible to select from the genomic DNA 
population by hybridization as hereinabove described only DNA which includes a signal or 
secretion sequence. The following paragraphs describe the protocol for this aspect of the 
invention, the nature and function of secretion signal sequences in general and a specific 
exemplary application of such sequences to an assay or selection process. 
[000212] A particularly aspect of this aspect further comprises, after (a) but before (b) 
above, the steps of: 

(i) contacting the single-stranded DNA population of (a) with a ligand-bound 
oligonucleotide probe that is complementary to a secretion signal sequence unique to a given 
class of proteins under hybridization conditions to form a double-stranded DNA duplex; 

(ii) contacting the duplex of (i) with a solid phase specific binding partner for said 
ligand so as to produce a solid phase complex; 

(iii) separating the solid phase complex from the single-stranded DNA population of 

(a); 

(iv) denaturing the duplex so as to release single-stranded DNA members of the 
genomic population; and 
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(v) separating the single-stranded DNA members from the solid phase bound probe. 
[000213] The DNA which has been selected and isolated to include a signal sequence is 
then subjected to the selection procedure hereinabove described to select and isolate 
therefrom DNA which binds to one or more probe DNA sequences derived from DNA 
5 encoding an enzyme(s) having the specified enzyme activity. This procedure is described 
and exemplified in U.S. Pat. No. 6,054,267, incorporated herein by reference in its entirety. 
[000214] In vivo biopanning may be performed utilizing a (fluorescence activated cell 
sorter) FACS-based machine. Complex gene libraries are constructed with vectors which 
contain elements which stabilize transcribed RNA. For example, the inclusion of sequences 

1 0 which result in secondary structures such as hairpins which are designed to flank the 

transcribed regions of the RNA would serve to enhance their stability, thus increasing then- 
half life within the cell. The probe molecules used in the biopanning process consist of 
oligonucleotides labeled with reporter molecules that only fluoresce upon binding of the 
probe to a target molecule. These probes are introduced into the recombinant cells from the 

1 5 library using one of several transformation methods. The probe molecules bind to the 

transcribed target mRNA resulting in DNA/RNA heteroduplex molecules. Binding of the 
probe to a target will yield a fluorescent signal that is detected and sorted by the FACS 
machine during the screening process. 

[000215] In some aspects, the nucleic acid encoding one of the polypeptides of the Group B 
20 amino acid sequences, sequences substantially identical thereto, or fragments comprising at 
least about 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids thereof 
is assembled in appropriate phase with a leader sequence capable of directing secretion of the 
translated polypeptide or fragment thereof. Optionally, the nucleic acid can encode a fusion 
polypeptide in which one of the polypeptides of the Group B amino acid sequences, 
25 sequences substantially identical thereto, or fragments comprising at least 5, 10, 15, 20, 25, 
30, 35, 40, 50, 75, 100, or 150 consecutive amino acids thereof is fused to heterologous 
peptides or polypeptides, such as N-terminal identification peptides which impart desired 
characteristics, such as increased stability or simplified purification. 
[000216] The host cell may be any of the host cells familiar to those skilled in the art, 
30 including prokaryotic cells, eukaryotic cells, mammalian cells, insect cells, or plant cells. As 
representative examples of appropriate hosts, there may be mentioned: bacterial cells, such as 
E. coli, Streptomyces, Bacillus subtilis, Salmonella typhimurium and various species within 
the genera Pseudomonas, Streptomyces, and Staphylococcus, fungal cells, such as yeast, 
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insect cells such as Drosophila S2 and Spodoptera Sf9, animal cells such as CHO, COS or 
Bowes melanoma, and adenoviruses. The selection of an appropriate host is within the 
abilities of those skilled in the art. 

[000217] Where appropriate, the engineered host cells can be cultured in conventional 
5 nutrient media modified as appropriate for activating promoters, selecting transformants or 
amplifying the genes of the invention. Following transformation of a suitable host strain and 
growth of the host strain to an appropriate cell density, the selected promoter may be induced 
by appropriate means (e.g., temperature shift or chemical induction) and the cells may be 
cultured for an additional period to allow them to produce the desired polypeptide or 
1 0 fragment thereof. 

[000218] Cells are typically harvested by centrifugation, disrupted by physical or chemical 
means, and the resulting crude extract is retained for further purification. Microbial cells 
employed for expression of proteins can be disrupted by any convenient method, including 
freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents. Such 

1 5 methods are well known to those skilled in the art. The expressed polypeptide or fragment 
thereof can be recovered and purified from recombinant cell cultures by methods including 
ammonium sulfate or ethanol precipitation, acid extraction, anion or cation exchange 
chromatography, phosphocellulose chromatography, hydrophobic interaction 
chromatography, affinity chromatography, hydroxylapatite chromatography and lectin 

20 chromatography. Protein refolding steps can be used, as necessary, in completing 

configuration of the polypeptide. If desired, high performance liquid chromatography 
(HPLC) can be employed for final purification steps. 

[000219] Various mammalian cell culture systems can also be employed to express 
recombinant protein. Examples of mammalian expression systems include the COS-7 lines 
25 of monkey kidney fibroblasts (described by Gluzman (1 98 1), Cell 23: 1 75,), and other cell 
lines capable of expressing proteins from a compatible vector, such as the C127, 3T3, CHO, 
HeLa and BHK cell lines. 

[000220] The invention also relates to variants of the polypeptides of the Group B amino 
acid sequences, sequences substantially identical thereto, or fragments comprising at least 5, 
30 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids thereof. In particular, 
the variants may differ in amino acid sequence from the polypeptides of the Group B amino 
acid sequences, and sequences substantially identical thereto, by one or more substitutions, 
additions, deletions, fusions and truncations, which may be present in any combination. 
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[000221] the variants may be naturally occurring or created in vitro. In particular, such 
variants may be created using genetic engineering techniques such as site directed 
mutagenesis, random chemical mutagenesis, Exonuclease III deletion procedures, and 
standard cloning techniques. Alternatively, such variants, fragments, analogs, or derivatives 

5 may be created using chemical synthesis or modification procedures. 

[000222] Other methods of making variants are also familiar to those skilled in the art. 
These include procedures in which nucleic acid sequences obtained from natural isolates are 
modified to generate nucleic acids which encode polypeptides having characteristics which 
enhance their value in industrial or laboratory applications. In such procedures, a large 

10 number of variant sequences having one or more nucleotide differences with respect to the 
sequence obtained from the natural isolate are generated and characterized. Typically, these 
nucleotide differences result in amino acid changes with respect to the polypeptides encoded 
by the nucleic acids from the natural isolates. 
Error Prone PCR 

1 5 [000223] For example, variants may be created using error prone PCR. In error prone PCR, 
PCR is performed under conditions where the copying fidelity of the DNA polymerase is 
low, such that a high rate of point mutations is obtained along the entire length of the PCR 
product. Error prone PCR is described in Leung et al. (1989), Technique 1:11-15 and 
Caldwell et al. (1992), PCR Methods Applic. 2:28-33, the disclosures of which are 

20 incorporated herein by reference in their entirety. Briefly, in such procedures, nucleic acids 
to be mutagenized are mixed with PCR primers and reagents (e.g., reaction buffer, MgCl 2 , 
MnCl2, Taq polymerase and an appropriate concentration of dNTPs) for achieving a high rate 
of point mutation along the entire length of the PCR product. For example, the reaction may 
be performed using 20 finoles of nucleic acid to be mutagenized, 30 pmoles of each PCR 

25 primer, a reaction buffer comprising 50mM KC1, lOmM Tris HC1 (pH 8.3) and 0.01% 
gelatin, 7mM MgCl 2 , 0.5mM MnCl 2 , 5 units of Taq polymerase, 0.2mM dGTP, 0.2mM 
dATP, ImM dCTP, and ImM dTTP. PCR maybe performed for 30 cycles of 94°C for 1 
min, 45°C for 1 min, and 72°C for 1 min. However, it will be appreciated that these 
parameters may be varied as appropriate. The mutagenized nucleic acids are cloned into an 

30 appropriate vector and the activities of the polypeptides encoded by the mutagenized nucleic 
acids are evaluated. 

[000224] Variants also may be created using oligonucleotide directed mutagenesis to 
generate site-specific mutations in any cloned DNA of interest. Oligonucleotide mutagenesis 
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is described in Reidhaar-Olson et al. (1988), Science, 241:53-57, the disclosure of which is 
incorporated herein by reference in its entirety. Briefly, in such procedures a plurality of 
double stranded oligonucleotides bearing one or more mutations to be introduced into the 
cloned DNA are synthesized and inserted into the cloned DNA to be mutagenized. Clones 
5 containing the mutagenized DNA are recovered and the activities of the polypeptides they 
encode are assessed. 
Assembly PCR 

[000225] Another method for generating variants is assembly PCR. Assembly PCR 
involves the assembly of a PCR product from a mixture of small DNA fragments. A large 
1 0 number of different PCR reactions occur in parallel in the same vial, with the products of one 
reaction priming the products of another reaction. Assembly PCR is described in U.S. Pat. 
No. 5,965,408, the disclosure of which is incorporated herein by reference in its entirety. 
Sexual PCR mutagenesis 

[000226] Still another method of generating variants is sexual PCR mutagenesis. In sexual 

1 5 PCR mutagenesis, forced homologous recombination occurs between DNA molecules of 

different but highly related DNA sequence in vitro, as a result of random fragmentation of the 
DNA molecule based on sequence homology, followed by fixation of the crossover by primer 
extension in a PCR reaction. Sexual PCR mutagenesis is described in Stemmer (1994), Proc. 
Natl Acad. Sci. USA 91:10747-10751, the disclosure of which is incorporated herein by 

20 reference in its entirety. Briefly, in such procedures a plurality of nucleic acids to be 

recombined are digested with DNAse to generate fragments having an average size of about 
50-200 nucleotides. Fragments of the desired average size are purified and resuspended in a 
PCR mixture. PCR is conducted under conditions which facilitate recombination between 
the nucleic acid fragments. For example, PCR may be performed by resuspending the 

25 purified fragments at a concentration of 10-30 ng/fil in a solution of 0.2mM of each dNTP, 
2.2mM MgCl 2 , 50mM KC1, lOmM Tris HC1, pH 9.0, and 0.1% Triton X-100. 2.5 units of 
Taq polymerase per 100|il of reaction mixture is added and PCR is performed using the 
following regime: 94°C for 60 seconds, 94°C for 30 seconds, 50-55°C for 30 seconds, 72°C 
for 30 seconds (30-45 times) and 72°C for 5 minutes. However, it will be appreciated that 

30 these parameters may be varied as appropriate. In some aspects, oligonucleotides may be 
included in the PCR reactions. In other aspects, the Klenow fragment of DNA polymerase I 
may be used in a first set of PCR reactions and Taq polymerase may be used in a subsequent 
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set of PCR reactions. Recombinant sequences are isolated and the activities of the 
polypeptides they encode are assessed. 
In vivo Mutagenesis 

[000227] Variants may also be created by in vivo mutagenesis. In some aspects, random 
5 mutations in a sequence of interest are generated by propagating the sequence of interest in a 
bacterial strain, such as an E. coli strain, which carries mutations in one or more of the DNA 
repair pathways. Such "mutator" strains have a higher random mutation rate than that of a 
wild-type parent Propagating the DNA in one of these strains will eventually generate 
random mutations within the DNA. Mutator strains suitable for use for in vivo mutagenesis 
10 are described in PCT Publication No. WO 91/16427 the disclosure of which is incorporated 
herein by reference in its entirety. 
Cassette Mutagenesis 

[000228] Variants may also be generated using cassette mutagenesis. In cassette 
mutagenesis a small region of a double stranded DNA molecule is replaced with a synthetic 
15 oligonucleotide "cassette" that differs from the native sequence. The oligonucleotide often 
contains completely and/or partially randomized native sequence. 
Recursive Ensemble Mutagenesis 

[000229] Recursive ensemble mutagenesis may also be used to generate variants. 
Recursive ensemble mutagenesis is an algorithm for protein engineering (protein 

20 mutagenesis) developed to produce diverse populations of phenotypically related mutants 
whose members differ in amino acid sequence. This method uses a feedback mechanism to 
control successive rounds of combinatorial cassette mutagenesis. Recursive ensemble 
mutagenesis is described in Arkin et al. (1992), Proa Natl Acad Set USA, 89:7811-7815, 
the disclosure of which is incorporated herein by reference in its entirety. 

25 Exponential Ensemble Mutagenesis 

[000230] In some aspects, variants are created using exponential ensemble mutagenesis. 
Exponential ensemble mutagenesis is a process for generating combinatorial libraries with a 
high percentage of unique and functional mutants, wherein small groups of residues are 
randomized in parallel to identify, at each altered position, amino acids which lead to 

30 functional proteins. Exponential ensemble mutagenesis is described in Delegrave et al. 

(1993), Biotechnology Research 11:1548-1552, the disclosure of which incorporated herein 
by reference in its entirety. 
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Random and site-directed mutagenesis 

[000231] Random and site-directed mutagenesis is described in Arnold (1993), Current 
Opinions in Biotechnology 4:450-455, the disclosure of which is incorporated herein by 
reference in its entirety. 

5 Shuffling Procedures 

[000232J In some aspects, the variants are created using shuffling procedures wherein 
portions of a plurality of nucleic acids which encode distinct polypeptides are fused together 
to create chimeric nucleic acid sequences which encode chimeric polypeptides as described in 
U.S. Patent. Nos. 5,965,408 and 5,939,250, each of which is hereby incorporated by reference 

10 in their entireties. 

[000233] The variants of the polypeptides of the Group B amino acid sequences may be 
variants in which one or more of the amino acid residues of the polypeptides of the Group B 
amino acid sequences are substituted with a conserved or non-conserved amino acid residue 
(e.g-, a conserved amino acid residue) and such substituted amino acid residue may or may 

1 5 not be one encoded by the genetic code. 

[000234] Conservative substitutions are those that substitute a given amino acid in a 
polypeptide by another amino acid of like characteristics. Typically seen as conservative 
substitutions are the following replacements: replacements of an aliphatic amino acid such as 
Alanine, Valine, Leucine and Isoleucine with another aliphatic amino acid; replacement of a 

20 Serine with a Threonine or vice versa; replacement of an acidic residue such as Aspartic acid 
and Glutamic acid with another acidic residue; replacement of a residue bearing an amide 
group, such as Asparagine and Glutamine, with another residue bearing an amide group; 
exchange of a basic residue such as Lysine and Arginine with another basic residue; and 
replacement of an aromatic residue such as Phenylalanine, Tyrosine with another aromatic 

25 residue. 

[000235] Other variants are those in which one or more of the amino acid residues of the 
polypeptides of the Group B amino acid sequences includes a substituent group. 
[000236] Still other variants are those in which the polypeptide is associated with another 
compound, such as a compound to increase the half-life of the polypeptide (for example, 
30 polyethylene glycol). 

[000237] Additional variants are those in which additional amino acids are fused to the 
polypeptide, such as a leader sequence, a secretory sequence, a proprotein sequence or a 
sequence which facilitates purification, enrichment, or stabilization of the polypeptide. 
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[000238] In some aspects, the fragments, derivatives and analogs retain the same biological 
function or activity as the polypeptides of the Group B amino acid sequences, and sequences 
substantially identical thereto. In other aspects, the fragment, derivative, or analog includes a 
proprotein, such that the fragment, derivative, or analog can be activated by cleavage of the 
5 proprotein portion to produce an active polypeptide. 

[000239] Another aspect of the invention is polypeptides or fragments thereof which have at 
least about 85%, at least about 90%, at least about 95%, or more than about 95% homology to 
one of the polypeptides of the Group B amino acid sequences, sequences substantially 
identical thereto, or a fragment comprising at least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 

10 or 150 consecutive amino acids thereof. Percent identity may be determined using any of the 
programs described above which aligns the polypeptides or fragments being compared and 
determines the extent of amino acid homology or similarity between them. It will be 
appreciated that amino acid "homology" includes conservative amino acid substitutions such 
as those described above. In one aspect of the invention, the fragments can be used to 

15 generate antibodies. These antibodies can be used to immobilize nitrilases can be used in 
industrial processes. Polynucleotides encoding the nitrilases of the present invention can be 
used in a similar way. 

[000240] Alternatively, the homologous polypeptides or fragments may be obtained through 
biochemical enrichment or purification procedures. The sequence of potentially homologous 

20 polypeptides or fragments may be determined by proteolytic digestion, gel electrophoresis 
and/or microsequencing. The sequence of the prospective homologous polypeptide or 
fragment can be compared to one of the polypeptides of the Group B amino acid sequences, 
sequences substantially identical thereto, or a fragment comprising at least about 5, 10, 15, 
20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids thereof using any of the 

25 programs described herein. 

[000241] Another aspect of the invention is an assay for identifying fragments or variants of 
the Group B amino acid sequences, or sequences substantially identical thereto, which retain 
the enzymatic function of the polypeptides of the Group B amino acid sequences, and 
sequences substantially identical thereto. For example, the fragments or variants of the 

30 polypeptides, may be used to catalyze biochemical reactions, which indicate that said 

fragment or variant retains the enzymatic activity of the polypeptides in Group B amino acid 
sequences. 
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[000242] The assay for determining if fragments of variants retain the enzymatic activity of 
the polypeptides of the Group B amino acid sequences, and sequences substantially identical 
thereto includes the steps of: contacting the polypeptide fragment or variant with a substrate 
molecule under conditions which allow the polypeptide fragment or variant to function, and 
detecting either a decrease in the level of substrate or an increase in the level of the specific 
reaction product of the reaction between the polypeptide and substrate. 
[000243] The polypeptides of the Group B amino acid sequences, sequences substantially 
identical thereto or fragments comprising at least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 
1 50 consecutive amino acids thereof may be used in a variety of applications. For example, 
the polypeptides or fragments thereof may be used to catalyze biochemical reactions. In 
accordance with one aspect of the invention, there is provided a process for utilizing a 
polypeptide of the Group B amino acid sequences, and sequences substantially identical 
thereto or polynucleotides encoding such polypeptides for hydrolyzing aminonitriles. In such 
procedures, a substance containing a haloalkane compound is contacted with one of the 
polypeptides of the Group B amino acid sequences, and sequences substantially identical 
thereto under conditions which facilitate the hydrolysis of the compound. 
[000244] Antibodies - The polypeptides of Group B amino acid sequences, sequences 
substantially identical thereto or fragments comprising at least 5, 10, 15, 20, 25, 30, 35, 40, 
50, 75, 100, or 150 consecutive amino acids thereof, may also be used to generate antibodies 
which bind specifically to the enzyme polypeptides or fragments. The resulting antibodies 
may be used in immunoaffinity chromatography procedures to isolate or purify the 
polypeptide or to determine whether the polypeptide is present in a biological sample. In 
such procedures, a protein preparation, such as an extract, or a biological sample is contacted 
with an antibody capable of specifically binding to one of the polypeptides of the Group B 
amino acid sequences, sequences substantially identical thereto, or fragments of the foregoing 
sequences. 

[000245] In immunoaffinity procedures, the antibody is attached to a solid support, such as 
a bead or column matrix. The protein preparation is placed in contact with the antibody 
under conditions under which the antibody specifically binds to one of the polypeptides of the 
Group B amino acid sequences, sequences substantially identical thereto, or fragments 
thereof. After a wash to remove non-specifically bound proteins, the specifically bound 
polypeptides are eluted. 
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[0002461 The ability of proteins in a biological sample to bind to the antibody may be 
determined using any of a variety of procedures familiar to those skilled in the art. For 
example, binding may be determined by labeling the antibody with a detectable label such as 
a fluorescent agent, an enzymatic label, or a radioisotope. Alternatively, binding of the 
5 antibody to the sample may be detected using a secondary antibody having such a detectable 
label thereon. Particular assays include ELISA assays, sandwich assays, radioimmunoassays, 
and Western Blots. 

[000247] The antibodies of the invention can be attached to solid supports and used to 
immobilize nitrilases of the present invention. Such immobilized nitrilases can be used, as 
1 0 described above, in industrial chemical processes for the conversion of nitriles to a wide 
range of useful products and intermediates. 

[000248] Polyclonal antibodies generated against the polypeptides of the Group B amino 
acid sequences, and sequences substantially identical thereto, or fragments comprising at 
least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids thereof can be 

1 5 obtained by direct injection of the polypeptides into an animal or by administering the 

polypeptides to an animal. The antibody so obtained will then bind the polypeptide itself. In 
this manner, even a sequence encoding only a fragment of the polypeptide can be used to 
generate antibodies which may bind to the whole native polypeptide. Such antibodies can 
then be used to isolate the polypeptide from cells expressing that polypeptide. 

20 [000249] For preparation of monoclonal antibodies, any technique which provides 

antibodies produced by continuous cell line cultures can be used. Examples include the 
hybridoma technique (Kohler and Milstein (1975), Nature, 256:495-497, the disclosure of 
which is incorporated herein by reference), the trioma technique, the human B-cell 
hybridoma technique (Kozbor et al. (1983), Immunology Today 4:72, the disclosure of which 

25 is incorporated herein by reference), and the EBV-hybridoma technique (Cole et al. (1985), in 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96, the disclosure of 
which is incorporated herein by reference in its entirety). 

[000250] Techniques described for the production of single chain antibodies (U.S. Pat. No. 
4,946,778, the disclosure of which is incorporated herein by reference in its entirety) can be 
30 adapted to produce single chain antibodies to the polypeptides of, for example, the Group B 
amino acid sequences, or fragments thereof. Alternatively, transgenic mice may be used to 
express humanized antibodies to these polypeptides or fragments. 
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[000251] Antibodies generated against a polypeptide of the Group B amino acid sequences, 
sequences substantially identical thereto, or fragments comprising at least 5, 10, 15, 20, 25, 
30, 35, 40, 50, 75, 100, or 150 consecutive amino acids thereof may be used in screening for 
similar polypeptides from other organisms and samples. In such techniques, polypeptides 
5 from the organism are contacted with the antibody and those polypeptides which specifically 
bind to the antibody are detected. Any of the procedures described above may be used to 
detect antibody binding. One such screening assay is described in "Methods for Measuring 
Cellulase Activities", Methods in Enzyniology, 160:87-1 16, which is hereby incorporated by 
reference in its entirety. 

10 Use of Whole Cells Comprising A Nucleic Acid 

[000252] The invention provides for the use of whole cells which have been transformed 
with nucleic acid (or an active fragment thereof) encoding one or more of the nitrilases of the 
invention. The invention also provides for the use of such a whole cell in performing a 
nitrilase reaction on a substrate. Therefore, this invention provides for methods of 

1 5 hydrolyzing a cyanohydrin or aminonitrile linkage using a whole cell comprising at least one 
nucleic acid or polypeptide disclosed herein (SEQ ID NOS: 1-386). For example, a whole 
cell which is stably transfected (the invention also encompasses transiently transfected or 
transformed whole cells) with a nucleic acid encoding a nitrilase is one aspect of the 
invention. Such a cell is useful as a reagent in a reaction mixture to act on a substrate and 

20 exhibit nitrilase activity. 

Sequence Analysis Software 
[000253] Percent identity or homology between two or more sequences is typically 
measured using sequence analysis software (e.g. t Sequence Analysis Software Package of the 

25 Genetics Computer Group, University of Wisconsin Biotechnology Center, Madison, WI). 
Such software matches similar sequences by assigning a percent identity or homology to 
various deletions, substitutions and other modifications. The term "percent identity," in the 
context of two or more nucleic acids or polypeptide sequences, refers to the percentage of 
nucleotides or amino acid residues that are the same when compared after being aligned for 

30 maximum correspondence over a designated region or comparison "window." Under some 
algorithms, a conservative amino acid substitution can be considered "identical" and a change 
at a wobble site of a codon can be considered "identical." 
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[000254J "Alignment" refers to the process of lining up two or more sequences to achieve 
maximal correspondence for the purpose of assessing the degree of identity or homology, as 
defined within the context of the relevant alignment algorithm. 

[000255J For sequence comparison, typically one sequence acts as a reference sequence, to 
5 which test sequences are compared. When using a sequence comparison algorithm, test and 
reference sequences are entered into a computer, subsequence coordinates are designated, if 
necessary, and sequence algorithm program parameters are designated for a particular 
algorithm. Default program parameters can be used, or alternative parameters can be 
designated. The sequence comparison algorithm then calculates the percent identity or 
1 0 homology for the test sequences relative to the reference sequence, based on the program 
parameters. 

[000256] A "comparison window", as used herein, is a segment of the contiguous positions 
in a nucleic acid or an amino acid sequence consisting of from 20 to 600, usually about 50 to 
about 200, more usually about 100 to about 150 nucleotides or residues, which may be 

1 5 compared to a reference sequence of the same or different number of contiguous positions 
after the two sequences are optimally aligned. Methods of alignment of sequences for 
comparison are well-known in the art. Optimal alignment of sequences for comparison can 
be conducted, e.g., by the local homology algorithm of Smith and Waterman (1981), Adv. 
AppL Math. 2:482, by the homology alignment algorithm of Needleman and Wunsch (1970), 

20 J. Mol Biol 48:443, by the search for similarity method of Pearson and Lipman (1988), Proc. 
Natl Acad. Sci. USA 85:2444-2448, by computerized implementations of these algorithms, or 
by manual alignment and visual inspection. Other algorithms for determining homology or 
identity include, for example, the BLAST program (Basic Local Alignment Search Tool, 
National Center for Biological Information), BESTFIT, FASTA, and TFASTA (Wisconsin 

25 Genetics Software Package, Genetics Computer Group, Madison, WI), ALIGN, AMAS 

(Analysis of Multiply Aligned Sequences), AMPS (Alignment of Multiple Protein Sequence), 
ASSET (Aligned Segment Statistical Evaluation Tool), BANDS, BESTSCOR, BIOSCAN 
(Biological Sequence Comparative Analysis Node), BLIMPS (BLocks IMProved Searcher), 
Intervals and Points, BMB, CLUSTAL V, CLUSTAL W, CONSENSUS, LCONSENSUS, 

30 WCONSENSUS, Smith-Waterman algorithm, DARWIN, Las Vegas algorithm, FNAT 

(Forced Nucleotide Alignment Tool), Frameahgn, Framesearch, DYNAMIC, FILTER, FSAP 
(Fristensky Sequence Analysis Package), GAP (Global Alignment Program), GENAL, 
GIBBS, GenQuest, ISSC (Sensitive Sequence Comparison), LALIGN (Local Sequence 
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Alignment), LCP (Local Content Program), MACAW (Multiple Alignment Construction and 
Analysis Workbench), MAP (Multiple Alignment Program), MBLKP, MBLKN, PIMA 
(Pattern-Induced Multi-sequence Alignment), SAGA (Sequence Alignment by Genetic 
Algorithm) and WHAT-IF. Such alignment programs can also be used to screen genome 
5 databases to identify polynucleotide sequences having substantially identical sequences. A 
number of genome databases are available, for example, a substantial portion of the human 
genome is available as part of the Human Genome Sequencing Project (J. Roach, 
http://weber.u.Washington.edu/-roach/human_ genome_ progress 2.html) (Gibbs, 1995). At 
least twenty-one other genomes have already been sequenced, including, for example, M 
10 genitalium (Fraser et al., 1995), M.jannaschii (Bult et aL, 1996), H. influenzae (Fleischmann 
et al., 1995), E. coli (Blattner et al, 1997), and yeast (S. cerevisiae) (Mewes et al., 1997), and 
D. melanogaster (Adams et al, 2000). Significant progress has also been made in 
sequencing the genomes of model organism, such as mouse, C. elegans, mdArabadopsis sp. 
Several databases containing genomic information annotated with some functional 
15 information aie maintained by different organizations, and are accessible via the internet, for 
example, http://wwwtigr.org/tdb; http://www.genetics.wisc.edu; http://genome- 
www.stanford.edu/~ball; http://hiv-web.lanl.gov; http://www.ncbi.nlm.nih.gov; 
http://www.ebi.ac.uk; http://Pasteur.fr/other/biology; and http://www.genome.wi.mit.edu. ; 
[000257] Examples of useful algorithms are the BLAST and the BLAST 2.0 algorithms, 
20 which are described in Altschul et al. (1977), Nuc. Acids Res. 25:3389-3402, and Altschul et 
, al. (1990), J. Mol Biol. 215:403-410, respectively. Software for performing BLAST 
analyses is publicly available through the National Center for Biotechnology Information 
(http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring 
sequence pairs (HSPs) by identifying short words of length W in the query sequence, which 
25 either match or satisfy some positive-valued threshold score T when aligned with a word of 
the same length in a database sequence. T is referred to as the neighborhood word score 
threshold (Altschul et al., supra). These initial neighborhood word hits act as seeds for 
initiating searches to find longer HSPs containing them. The word hits are extended in both 
directions along each sequence for as far as the cumulative alignment score can be increased. 
30 Cumulative scores are calculated using the parameter M (reward score for a pair of matching 
residues; always >0). For amino acid sequences, a scoring matrix is used to calculate the 
cumulative score. Extension of the word hits in each direction are halted when: the 
cumulative alignment score falls off by the quantity X from its maximum achieved value; the 
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cumulative score goes to zero or below, due to the accumulation of one or more negative- 
scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm 
parameters W, T, and X determine the sensitivity and speed of the alignment. For nucleotide 
sequences, the BLASTN program uses as defaults a wordlength (W) of 1 1, an expectation (E) 
of 10, M=5, N=-4 and a comparison of both strands. For amino acid sequences, the BLASTP 
program uses as defaults a wordlength (W) of 3, an expectation (E) of 10, and the 
BLOSUM62 scoring matrix (see Henikoff and Henikoff (1 989), Proc. Natl Acad Sci. USA 
89:10915). 

[000258] The BLAST algorithm also performs a statistical analysis of the similarity 
between two sequences (see, e.g., Karlin and Altschul (1993), Proc. Natl Acad. Set USA 
90:5873). One measure of similarity provided by BLAST algorithm is the smallest sum 
probability (P(N)) S which provides an indication of the probability by which a match between 
two nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid 
is considered similar to a references sequence if the smallest sum probability in a comparison 
of the test nucleic acid to the reference nucleic acid is Jess than about 0.2, less than about 
0.01, or less than about 0.001. 

[000259] In one aspect, protein and nucleic acid sequence homologies are evaluated using 
the Basic Local Alignment Search Tool ("BLAST"). In particular, five specific BLAST 
programs are used to perform the following task: 

(1) BLASTP and BLAST3 compare an amino acid query sequence against 
a protein sequence database; 

(2) BLASTN compares a nucleotide query sequence against a nucleotide 

sequence database; 

(3) BLASTX compares the six-frame conceptual translation products of a 
query nucleotide sequence (both strands) against a protein sequence database; 

(4) TBLASTN compares a query protein sequence against a nucleotide 
sequence database translated in all six reading frames (both strands); and 

(5) TBLASTX compares the six-frame translations of a nucleotide query 
sequence against the six-frame translations of a nucleotide sequence database. 

[000260] The BLAST programs identify homologous sequences by identifying similar 
segments, which are referred to herein as "high-scoring segment pairs," between a query 
amino or nucleic acid sequence and a test sequence which may be obtained from a protein or 
nucleic acid sequence database. High-scoring segment pairs are identified (i.e., aligned) by 
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means of a scoring matrix, many of which are known in the art. In one example, the scoring 
matrix used is the BLOSUM62 matrix (Gonnet et al. (1992), Science 256:1443-1445; 
Henikoff and Henikoff (1993), Proteim 17:49-61). In another example, the PAM or 
PAM250 matrices may also be used (see, e.g., Schwartz and Dayhoff, eds. (1978), Matrices 
5 for Detecting Distance Relationships: Atlas of Protein Sequence and Structure, Washington: 
National Biomedical Research Foundation). BLAST programs are accessible through the 
U.S. National Library of Medicine, e.g., at www.ncbi.nlm.nih.gov. 

[000261] The parameters used with the above algorithms may be adapted depending on the 
sequence length and degree of homology studied. In some aspects, the parameters may be 

10 the default parameters used by the algorithms in the absence of instructions from the user. 
[000262] In a particular aspect, the invention provides a method for modifying small 
molecules, comprising contacting a polypeptide encoded by a polynucleotide described 
herein or enzymatically active fragments thereof with a small molecule to produce a modified 
small molecule. A library of modified small molecules is tested to determine if a modified 

15 small molecule is present within the library which exhibits a desired activity. A specific 
biocatalytic reaction which produces the modified small molecule of desired activity is 
identified by systematically eliminating each of the biocatalytic reactions used to produce a 
portion of the library, and then testing the small molecules produced in the portion of the 
library for the presence or absence of the modified small molecule with the desired activity. 

20 The specific biocatalytic reactions, which produce the modified small molecule of, desired 
activity is optionally repeated. The biocatalytic reactions are conducted with a group of 
biocatalysts that react with distinct structural moieties found within the structure of a small 
molecule, each biocatalyst is specific for one structural moiety or a group of related structural 
moieties; and each biocatalyst reacts with many different small molecules which contain the 

25 distinct structural moiety. 

[000263] Some aspects of the use of the nitrilases are: 

q-hvdroxv acid - Nitrilases produce cc-hydroxy acids through hydrolysis of 
cyanohydrins. Production of mandelic acid and derivatives thereof is an example of this. A 
significant application of this type involves commercial production of (7^-mandelic acid in 

30 both high yield and high enantioselectivity from mandelonitrile. Mandelic acid and 
derivatives have found broad application as intermediates and resolving agents for the 
production of many chiral pharmaceutical and agricultural products. Previous attempts to 
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employ the few known nitrilases in processes using analogous substrates have been plagued 
by significantly lower activity, productivity, and selectivity. 
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Phenvllactic acid derivatives 

[000264] An additional application is in the production of (o>phenyl lactic acid derivatives 
in both high yield and high enantioselectivity. Phenyl lactic acid derivatives have found 
broad application in the production of many chiral pharmaceutical and agricultural products. 
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97% 96% 97% 



g-hvdroxy acid 

[000265] With important commercial considerations, nitrilases are provided produce 
15 either enantiomer of 4-cyano-3-hydroxybutyric acid, the (Xj-enanatiomer of which is a key 
intermediate in the synthesis of the drug LIPITOR™. 
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OH SEQ ID NOS: 109,110 OH 
NC . PN - NC ^^v^-COO H 



hydroxyglutaryl nitrile 



Selected 
N itrilases 



QH 

NC ^ ^COO H 



(R)-3-hydroxy-4-cya no-butyric acid 



(S)-3-hydroxy-4-cya no-butyric acid 




LiPITOR™ 



[000266} The following nitrilases are more examples of nitrilases useful in converting 
hydroxyglutarylnitrile to (R)-3-hydroxy-4-cyano-butyric acid: SEQ ID NOS:205, 206, SEQ 

5 ID NOS:207, 208, SEQ ID NOS:195, 196, SEQ ID NOS:43, 44, SEQ ID NOS:321, 322, and 
SEQ ID NOS:237, 238. The above schematic indicates that "selected nitrilases" can be used 
to convert hydroxyglutarylnitrile to (S)-3-hydroxy-4-cyano-butyric acid: SEQ ID NOS:107, 
108, SEQ ID NOS:109, 110, SEQ IDNOS:lll, 112, SEQIDNOS:127, 128, SEQ ID 
NOS:129, 130, SEQIDNOS:133, 134, SEQ ID NOS: 113, 114, SEQ ID NOS:145, 146, SEQ 

10 IDNOS:101, lO^SEQIDNOS:^, 180,SEQIDNOS:201,202, SEQ ID NOS: 159, 160, 
SEQIDNOS:177, 178, SEQ ID NOS:181, 182, SEQIDNOS:183, 184, SEQ ID NOS:185, 
186, SEQ ID NOS:57, 58, SEQ ID NOS:197, 198, SEQ ID NOS:59, 60, SEQ ID NOS:67, 
68, and SEQ ID NOS:359, 360. 

[000267] The invention will be further described with reference to the following examples; 

15 however, it is to be understood that the invention is not limited to such examples. Rather, in 
view of the present disclosure which describes the current best mode for practicing the 
invention, many modifications and variations would present themselves to those of skill in 
the art without departing from the scope and spirit of this invention. All changes, 
modifications, and variations coming within the meaning and range of equivalency of the 

20 claims are to be considered within their scope. 

EXAMPLES 
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[000268J For each library to be screened for nitrilases, an infection was set up as follows: 
5ml of an ODeoonni^i resuspension of SEL700 cells and 1ml of the phagemid library to be 
screened were combined. The combination was incubated in a 37°C waterbath for 45 min. 

5 [000269] Using the infection, serial dilutions were made in 1 OmM MgS0 4 , using 1 Opl 
aliquots of the infection. 

titer of library dilutions to make 

~10 5 cfu/ml 10* 1 dilution 

~10 6 cfu/ml 10'\ 10- 2 dilution 

10 ~10 7 cfu/ml 10'\ 10" 2 , 10" 3 dilution 

[000270] 60pl of each of the following dilutions were deposited onto a small LB-kan 50 
plate: 

titer of library dilutions to make 

15 ~10 5 cfu/ml undiluted infection, 10" 1 dilution 

~10 6 cfii/ml 10" 1 , 10" 2 dilutions 

-1 0 7 cfu/ml 1 0" 2 , 1 0' 3 dilutions 

[000271] The cells in the infection were centrifuged in a tabletop centrifuge at 4°C, 4.6k 
20 rpm, 10 min to form pellets. The supernatant was decanted from the resulting pellets. The 
cells were resuspended in residual liquid. All of the resuspended cells were deposited onto a 
single large LB-kan 50 plate. All plates were incubated at 30°C overnight. 

Example 2: Selection Screenings 

[000272] The cells of each infection plate were resuspended with ~4mls 1 OmM MgS04. 

25 The resuspensions were placed in a tube. The remaining cells on each plate were 

resuspended with ~3 mis lOmM MgSCU and combined with the first resuspension from the 
same plate. The volume of each tube was brought to 12ml with IQmM MgS0 4 , The tubes 
were vortexed vigorously. The tubes were centrifuged in a tabletop centrifuge at 4°C and 
4.6k for lGmin to form pellets. The supernatant was decanted from each resuspension. The 

30 washed cells in each tube were resuspended with 1 0ml lOmM MgSO/t. The resuspensions 
from each library were stored at 4°C until the selection cultures were ready to be set up. 

[000273] For each resuspension, selection cultures were set up using the following process: 
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1) The nitrilase selection medium was prepared, using: 1XM9 medium with 0.2% 
glucose, no nitrogen and 50pg/ml kanamycin (for pBK phagemid libraries only; 
use ampicillin for pBS libraries). 

2) 5ml of the medium was aliquoted into a 50ml screw top conical tube. 
5 3) 25plofthe stored resuspension was added to the tube. 

4) 5pl of adiponitrile was added to the tube, to bring the final concentration to 
8.8mM. Additional nitrile substrates may be used, in place of adiponitrile. 

5) The resulting combination was cultured at 30°C. 

10 Steps 1-5 were repeated for each nitrile substrate. 

Example 3: Isolation of a positive nitrilase clone from selection cultures 

1000274] Ten (10) pi of selection culture with growth was streaked out onto a small LB- 
kan 50 plate and allowed to grow for 2 nights at 30°C. Five isolated cfu were picked and each 
was grown in 2ml nitrilase selection medium at 30°C. Each culture was monitored (where 

15 growth indicates positive cfu was picked), and was removed when monitoring indicated that 
it was in a stationary phase of growth. One (1) ml of culture was used to do a plasmid 
preparation and was eluted with 40pl elution buffer. Five to eight (5-8) pi DNA was cut with 
Pst I/Xho I or Sac I/Kpn I restriction enzymes to remove insert from vector. A restriction 
fragment length polymorphism (RFLP) determination was carried out to identify the size of . 

20 the insert. The insert was sequenced. 

Example 4: Screenine and Characterization of Nitrilases 

[000275] Nitrilases of the invention were screened against target substrates. Of those 
showing hydrolytic activity in a primary screen, enzymes with enantioselectivities above 20% 
enantiomeric excess (ee) were selected for further characterization. Those enzymes were 

25 selected based on: 1) having activity against one of the substrates of interest and 2) 
exhibition of greater than 35% ee (enantiomeric excess). The results of this screening 
process are set forth in Table 1 above. The products used for screening were: D- 
Phenylglycine, L-Phenyllactic acid, (R) 2-chloromandelic acid, OS)- Cyclohexylmandelic 
acid, L-2-methylphenylglycine, (S)-2-amino-6-hydroxy hexanoic acid, and 4-methyl-L- 

30 leucine. 
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CN 




H 2 N V H 



NH 2 



COOH 



Phenylglycinonitrile D-phenylglycine 

[000276] The hydrolysis of phenylglycinonitrile was performed. Some of these enzymes 
5 showed an ee higher than 20% and those were selected for preliminary characterization. 

[000277] Based on the preliminary characterization experiments, a number of putative hits 
were identified on phenylglycinonitrile and a large amount of data was accumulated on these 
enzymes. The data revealed many common properties: the majority of the enzymes had pH 
optima for activity at pH 7 and, in general, the enantioselectivity was enhanced at the lower 

10 pH values. The enzymes were found to be more active at higher temperature, particularly 
38°C, although this temperature often resulted in lower enantioselectivities. The use of 
water-miscible co-solvents in the reaction was shown to be a practical option. The inclusion 
of 10-25% methanol (v/v) in the enzyme reactions did not substantially affect enzyme 
activity and in many cases, led to an increase in enantioselectivity. The use of biphasic 

1 5 systems has also shown some promise, with the enzymes maintaining their level of activity 
with the addition of up to 70% (v/v) of hexane and, in some cases, toluene. The use of ethyl 
acetate in the biphasic systems, however, led to lower activity. 

[000278] Of the enzymes identified active on phenylglycinonitrile, the enantioselectivity of 
several enzymes was shown to remain above the success criterion of 35% ee. The 

20 preliminary characterization data indicated that some of the enzymes exhibited high 

enantioselectivities for D-phenylglycine, with corresponding conversion to product of 40- 
60%. Further investigation suggested that the rate of activity of some of these enzymes was 
faster than the rate of racemization of the substrate. Reducing the concentration of enzyme 
led to improved enantioselectivity; therefore, it appears that some benefit could be gained by 

25 control of the relative rates of the chemical racemization and the enzyme activity. 
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OH 



HO H 



2-chloromandelonitrile (R)-2-chloromandelic acid 




CN 




COOH 



[000279] 



Enzymes were identified which showed activity on 2- 



5 chloiomandelonitrile. A high degree of overlap existed between the enzymes which were 
active on 2-chloromandelonitrile and phenylglycinonitrile. Many of these enzymes also 
formed a distinct sequence family. 



highest activity for the active enzymes. For the majority of the nitrilases, the 
10 enantioselectivity also increased at higher temperatures, particularly 38°C. The enzymes 

retained their activity in the presence of up to 25% methanol or 10% isopropanol; in many of • 
these cases, the enantioselectivity was also enhanced. Activity in biphasic systems was 
largely comparable to aqueous conditions, particularly with hexane as the non-aqueous phase; 
varying tolerances to toluene were observed between the different nitrilases. 
15 Table 2. Summary of optimal conditions determined from characterization experiments 
for enantioselective hydrolysis of 2-chloromandelonitrile. 



[000280] 



Higher temperatures and neutral pH appeared to lead to the 
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SEQ ID 
NOS: 


Optimum pH 


Optimum 
Temp °C 


Solvent Tolerance 


385, 386 


7 


38 


25%MeOH 


169, 170 


5 


38 


1PA 


185, 186 


7 


38 


1PA 


47,48 


7 


38 


10%MeOH 


197, 198 


6 


55 


25%MeOH, 10% 
IPA 


A 187,188 


7 


38 


10%MeOH; 40% 
IPA 


1 217,218 


7 


38 


25%MeOH,10% 
IPA,70%hexane, 
40% toluene 


| 55, 56 


7 


38 


10%MeOH,IPA, 
70%hexane 


167,168 


9 


38 


10%MeOH, IPA, 
70%hexane 


15, 16 


7 


38 


25%MeOH, 10% 
IPA, 70%hexane, 
40% toluene 



Screening of nitrilases against target su bstrate (SVphenvllactic acid: 



ens; 



Phenylacetaldehyde 
cyanohydrin 



.COOH 
HO H 

(S)-Phenyllactic acid 




[000281] Many of the nitrilases tested were active on phenaylacetaldehyde cyanohydrin. 
Many of these enzymes were part of two related sequence families and were distinct from 
those enzymes that were active on phenylglycinonitrile and chloromandelonitrile. 

[000282] The pH optima of the enzymes was generally above pH 7 (i.e. pH 8 or 9), with 
10 higher enantioselectivities being exhibited at these levels. Most of the enzymes showed 
superior activity at higher temperature, particularly 38°C. The effect of temperature on the 
enantioselectivities of the enzymes varied; in most cases, this property was slightly lower at 
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higher temperatures. While the enzymes were tolerant towards the addition of co-solvents, 
particularly 10% (v/v) methanol, no advantage in activity or enantioselectivity was gained by 
such additions. The use of a biphasic system was again shown to be feasible. 
[000283] Table 3. Summary of optimal conditions determined from characterization 
5 experiments for enantioselective hydrolysis of phenylacetaldehyde cyanohydrin 



SEQIDNOS: 


Optimum pH 


Optimum Temp °C 


Solvent Tolerance 1 


103, 104 


7 


55 


10% MeOH, EPA 


99, 100 


8 


38 


10%MeOH, 70% 
hexane, toluene 


183, 184 


9 


38 


1ft%MeOH IP A. 
70% toluene, 
hexane 


173, 174 


5 


38-55 


9S%A/feOH TP A 
70% hexane, 
toluene 


211 214 


7 


38 


10%MeOH, 25% 
IP A, 70% hexane, 
toluene 


61, 62 


7 


38 


10% MeOH, 70% 
hexane, toluene 


205, 206 


8 


5 e- j j 


10%MeOH,IPA, 
40% hexane, 
toluene 


207, 208 


8 


38 


10%MeOH,70% 
hexane 


309, 210 


8 


38 


10%MeOH,40% 
hexane, toluene 


195, 196 


8 


38 


10% MeOH, 40% 
hexane, toluene 


43,44 


9 


38 


10% MeOH, 40% 
hexane 


161, 162 


9 


38 


25%MeOH,IPA, 
10% hexane, 
toluene 


175, 176 


6 


38-55 


10% MeOH, IPA, 
40% hexane 
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SEQIDNOS: 


Optimum pH 


Optimum Temp °C 


Solvent Tolerance 


293, 294 


6 


38 


10%MeOH,IPA, 
40% hexane 



Screening of nitrilases against target substrate L-2-methvlphenvl glycine 




2-methylphenylglycinonitrile L-2-methylphenylglycine 

5 

[000284] Nitrilases have shown activity on this substrate and preferentially yielded the D-2- 
methylphenylglycine, rather than the required L-2-methylphenylglcyine. 

Screening of nitrilases against target substrate L-hydroxvnorleucine ((SV2-amino-6-hvdroxv 
hexahoic acid) 

NH 2 H 2 N H 

5-hydroxypentanal L-hydroxynorleucine 
10 aminonitrile 

[000285] A number of nitrilases, which showed activity on 2-amino-6-hydroxy 
hexanenitrile, were isolated. All of these enzymes showed enantioselectivity towards the L- 
isomer of the product. 

[000286] The enzymes all showed higher enantioselectivities at higher pH and appeared to 
1 5 more susceptible to the addition of solvents than the other nitrilases tested. Although activity 
was detected in the presence of organic solvents, it was generally lower than that of the 
aqueous control. Once again, the activity of the enzymes was negatively affected by the acid 
product and aldehyde starting material. 

Table 4. Summary of optimal conditions determined from characterization experiments for 
20 enantioselective hydrolysis of 2-amino-6-hydroxy hexanenitrile. 
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SEQIB 
NOS: 


Optimum 
P H 


Optimum 
Temp°C 


Solvent 


217,218 


9 


38 


10%MeOH 


55, 56 


9 


38 


None 


187,188 


9 


38 


10% MeOH 


167, 168 


9 


38 


None 


221,222 


9 


38 





[000287] A range of hydrolytic activities was observed among the confirmed hit enzymes 
for 2-amino-6-hydroxy hexanenitrile. 

Screening of nitrilases against target substrate 4-methvl-D-leucine and 4-methvl-L-leucine 



CN 



NH 2 

3,3-dimethylbutanal 
aminonitrile 



,COOH 



-XX 

H 2 N H 
4-methyl-L-leucine 

XTX COOH 

H 2 Sf H 
4-methyl-D-leucine 

[000288] Hydrolysis of 2-amino-4,4-dimethyl pentanenitrile was performed by several of 
the nitrilases. Of these, some were shown to hydrolyse the nitrile to the L-isomer of the 
corresponding acid and were selected for further characterization. 

Table 5. Summary of optimal conditions determined from characterization experiments for 
enantioselective hydrolysis of 2-amino-4,4-dimethyl pentanenitrile 



SEQID 
NOS: 


Optimum pH 


Optimum 
Temp°C 


Solvent Tolerance 


103, 104 


7 


23 


25% MeOH, 10% EPA 


59, 60 


8 


23 


25% MeOH 


221,222 


6 


38 


25% MeOH, 10%IPA I 



Screening of nitrilases against target substrate fSVcvclohexvlmandelic acid 




Cyclohexylmandelonitrile (S)-cyclohexytmandelic acid 
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Screening of nitrilases against target substrate Mandelonitrile 




CN 




HO H 



OH 



COOH 



Mandelonitrile 



(R)-mandelic acid 



5 [000289] The nitrilase collection was also screened on mandelonitrile. The nitrilases 
actively hydrolyzed both phenylglycinonitrile and chloromandelonitrile. 

Enzymatic assay for determination of enantioselectivitv 

[000290] In the design of a spectroscopic system for determination of the chiral ct-hydroxy 
acids and a-amino acids, an enzyme based assay which permits the detection of product 

10 formation and enantioselectivity was developed and used. 

[000291] Spectroscopic systems for the detection of a-hydroxy- and for a-amino- acids 
based on lactate dehydrogenase (L-LDH & D-LDH) and on amino acid oxidase (L-AA Oxid 
& D-AA Oxid) are described in Figures 6 and 7. These enzymes were chosen because they 
are reported to have reasonably broad substrate ranges while still retaining near absolute 

1 5 enantiospecificity. 

[000292] The overall feasibility of this system has been established (Table 12). Neither the 
parent hydroxynitrile nor the aminonitrile is metabolized by the secondary or detection 
enzyme and thus starting material does not interfere. Cell lysate which is not heat treated 
results in background activity for the LDH system; however, heat inactivation eliminates the 

20 background activity. Cell lysate does not appear to interfere in the AA Oxidase assay. One 
concern is the inactivation of the AA Oxidase, which utilizes a FMN co-factor, by residual 
cyanide. However, the control studies indicated that at 2 mM PGN (which could release up to 
2 mM HCN) inactivation is not a problem. This assay is suitable for automation of 384 well 
(or possibly greater density) microtiter plates. 

25 Table 6: Summary of Identification of Secondary Enzyme to Chiral Detection of Acid 



Product. 
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SUBSTRATE 



ENZYME WITH SUITABLE 
ACTIVITY FOUND FROM 
COMMERCIAL SOURCE 



Hydroxy Acid Products: 



L-lactic acid 



D-lactic acid 



L-phenyl lactic acid_ 



D-phenyl lactic acid 



S-cyclohexylmandelic acid 
R-cycIohexylmandelic acid 



YES 



YES 



YES 



YES 



Not applicable 



Not applicable 



Amino Acid Products: 



4-methyl-L-leucine 



4-methyl-L/D-leucine 



D-phenylalanine 



R-phenylglycine 



L-homophcnyllactic acid 



D-homophenyllactic acid 



L-homophenylalanine^ 



P-homophenylalanine 



(S)-2-amino-6-hydroxy 
hexanoic acid 



(R/S)-2-amino-6-hydroxy 
hexanoic acid 



L-methylphenylglycine 



D-methylphenylglycine 



YES 



YES (P-unknown) 



YES 



YES 



YES 



YES 



YES 



YES 



YES 



YES (D-unknown) 



Not Applicable 



Not Applicable 



> : Tbe assay will not be applicable to cyclohexybnandelic acid and 2-metbylphenylglycine, as tertiary alcohols 
are not amenable to this particular oxidation 

5 Example 5: Standard assay conditions 

[000293] The following solutions were prepared: 

• Substrate stock solution: 50 mM of the aminonitrile substrate in 0.1 M phosphate 
buffer (pH 7) or 50 mM of the cyanohydrin substrate in 0.1 M Na Acetate buffer (pH 5) 

. Enzyme stock solution: 3.33 ml of 0.1 M phosphate buffer (pH 7) to each vial of 20 
10 mg of lyophilized cell lysate (final concentration 6 mg protein/ml) 



15 



[000294] Procedure: 

• Add 100 nl of the 50 mM substrate solution to the appropriate number of wells of a 

96-well plate 

• Add 80 \x\ of buffer to each well 

• Add 20 ^1 of enzyme solution to each well 

• Blank controls were set up by substitution of 20 \il of buffer for the enzyme solution 
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• Negative controls consisting of 20 jil of enzyme solution in 1 80 of buffer were 
also included in many of the experiments. Once it had been established that the cell lysate 
did not interfere with the detection of the products, these controls were not included. 
[000295] Sampling of reactions: 

5 • The reactions were sampled by removing an aliquot from each well (15-50 and 

diluting the samples as follows: 

• Samples for non-chiral HPLC analysis: 

• Phenylglycine, 2-chloromandelic acid and phenyllactic acid: initially, the samples 
were diluted 2-fold with water and a further 2-fold with methanol or acetonitrile (final 

10 dilution: 4-fold). It was found that an 8-fold dilution of these samples led to improved 
chromatographic separation 

• (S)-2-amino-6-hydroxy hexanoic acid, 4-methylleucine, /-leucine, 2- 
methylphenylglycine and cyclohexylmandelic acid: samples were diluted 1:1 with methanol 
or acetonitrile. The choice of solvent was based on the solvent used in the HPLC analysis 

15 method. 

[000296] -Samples for chiral HPLC analysis: 

• Phenylglycine, 2-chloromandelic acid and phenyllactic acid: as described above for 
the non-chiral analyses, the samples for chiral analyses were initially diluted 2-fold and in the 
later stages of the project, at 4-fold. 

20 • (S)-2-amino-6-hydroxy hexanoic acid, 4-methylleucine, /-leucine, 2- 

methylphenylglycine: samples were diluted 1:1 with methanol or acetonitrile. 
[000297] • For each experiment, a standard curve of the product was included in the HPLC 
run. The curve was plotted on an X-Y axis and the concentration of product in the samples 
calculated from the slope of these curves. 

25 [000298] • For the preliminary characterization experiments, samples were taken such that 
the activity of the enzymes was in the linear phase; this was performed so that differences in 
the effects of the parameters on the rate of reaction, rather than the complete conversion, 
could be determined. The sampling times are denoted in the tables included in the text. 
[000299] • The samples were analyzed by HPLC, using the methods outlined in Table 20 

JO and 21. 
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Example 6: Determination of the Effect of pH on enzyme activity and enanti oselectivitv 

[000300] The effect of pH on the enzyme activity and enantioselectivity was studied by 
performance of the standard assay in a range of different buffers: 

0.1 M Citrate Phosphate pH 5 
5 0.1 M Citrate Phosphate pH 6 

0.1 M Sodium Phosphate pH 7 

0.1MTris-HClpH8 

MTris-HClpH9 

10 [000301] The samples were analyzed by non-chiral and chiral HPLC methods and examples 
of the results are presented in Tables 5, 8 and 1 1 herein. 

Example 7: Determination of the effect of temperature on enzyme activity and 
enantioselectivity 

15 [000302] The effect of temperature on the activity and enantioselectivity was investigated 
by performing the standard assay at room temperature, 38°C and 55°C. The samples were 
analyzed by non-chiral and chiral HPLC methods and examples of the results are given in 
Tables 5, 8 and 11 herein. 

Example 8: Determination of the Effect of solvents on enzyme activity and enantioselectivity 

20 [000303] The enzyme reactions were performed in the presence of cosolvents and as 

biphasic systems, in order to investigate the effect of water-miscible and water-immiscible 
solvents on the enzymes. In the presence of cosolvents, the reactions were run under 
standard conditions, with substitution of the buffer with methanol or isopropanol. The final 
concentrations of solvent in the reactions was 0, 10, 25 and 40% (v/v). 

25 [000304] The biphasic reactions were also carried out under standard conditions, with a 
layer of water-immiscible organic solvent forming the nonaqueous phase. The solvent was 
added at the following levels: 0%, 10%, 40% and 70% (v/v) of the aqueous phase. The 
samples from these reactions were evaporated by centrifugation under vacuum and 
redissolved in a 50:50 mixture of methanol or acetonitrile and water. The samples were 

30 analyzed by non-chiral and chiral HPLC methods. 
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Activity 

[000305] The effect of the process components on the activity of the enzymes was 
5 established by addition of the individual components to the enzymatic reaction. These 
components included the starting materials for the nitrile synthesis, aldehyde, cyanide and 
ammonium, as well as triethylamine, which is added in catalytic amounts to the nitrile 
synthesis reaction. The concentrations of the reactants were selected with possible process 
conditions in mind and were adapted to the levels of reactants used in the enzyme assays. In 
10 some cases, the solubility of the aldehydes and products was relatively low; in these cases, 
the highest level of solubility was added to the reactions as the highest level and 10% of this 
level as the lower value. 

[000306] The enzymatic reactions were carried out under standard conditions, with addition 
of one or more of the following components: benzaldehyde, phenylglycine, 
15 phenylacetaldehyde, phenyllactic acid, 2-chlorobenzaldehyde, 2-chloromandelic acid, 5- 
hydroxypentanal, (S)-2-amino-6-hydroxy hexanoic acid, 4-methylleucine, KCN, 
Triethylamine, NH 4 C1. Control reactions were performed under standard conditions, with no 
additive. The samples were analyzed by non-chiral HPLC. 
Stability 

20 [000307] The stability of the enzymes to process conditions was monitored by incubation of 
the enzymes in the presence of the individual reaction components for predetermined time 
periods, prior to assay of the enzyme activity under standard conditions. In these 
experiments, the enzymes were incubated at a concentration of 1.2 mg protein/ml in the 
presence of each of the following reaction components: methanol, benzaldehyde, 

25 phenylglycine, phenylacetaldehyde, phenyllactic acid, 2-chlorobenzaldehyde, 2- 

chloromandelic acid, 5-hydroxypentanal, (S)-2-amino-6-hydroxy hexanoic acid, KCN, 
NH4CI. 

Assay conditions: 

[000308] At 0, 2, 6 and 24 hours of incubation in the particular additive, 50 ^1 of the 
30 enzyme solution was removed, 50 jil of a 50 mM substrate stock solution added and the 
enzyme activity assayed under standard conditions. After substrate addition, the reactions 
were sampled at the following times: Phenylglycinonitrile: 10 mins; Phenylacetaldehyde 
cyanohydrin: 1 hour; 2-chloromandelonitrile: 2 hours. Control reactions were performed by 
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incubation of the enzyme in buffer only. The samples were analyzed using non-chiral HPLC 
methods. 

Example 10: Confirmation of putative hit enzymes 

[000309] Following the preliminary characterization experiments, the enzymes which were 
5 identified as putative hits were assayed under the optimal conditions determined, in order to 
evaluate their performance, especially in terms of enantioselectivity, when higher conversions 
were attained. The enzymes were assayed with 25 mM substrate, under the conditions of pH 
and temperature noted in the tables included in the text. A standard concentration of 0.6 
mg/ml protein was used for each of the enzymes, unless otherwise stated. 

10 Example 1 1 : Selected examples of chromatograms from enzyme reactions 

[000310] In this section, representative examples of chromatograms for each substrate and 
product combination will be shown, together with a discussion of some of the challenges 
encountered with the methods and how they were addressed. 
D-Phenvl glycine 

15 [00031 1] Non-chiral analysis showing the substrate peak eluting at 2.6 min and 3.2 min. 
See Figures 8A-8E. The two peaks were present in all samples containing higher 
concentrations of the nitrite; the second peak is thought to be a product associated with the 
nitrile; it decreased with time and was no longer present once complete conversion to the 
product had taken place. The chromatogram shown in Figure 8 A is a blank control, 

20 containing only nitrile and buffer; the samples were all diluted with water and solvent as 
explained in section 1 above. This was repeated for all samples discussed below. An 
enzymatic reaction sample is shown in the chromatogram in Figure 8B , with the product 
eluting at 0.4 min. 

[000312] Of note in these chromatograms is the small solvent front peak eluting at 0.3 min, 
25 Further representation of this peak is given in the chromatogram shown in Figure 8C, in 
which a negative control consisting of cell lysate in buffer, was run. A very small peak 
coeluted with the product at 0.4 min. In the initial phase of the project, this peak was 
regarded as problematic, although the appropriate controls were run with each experiment for 
in order to maintain accuracy. In these experiments, the peak area resulting from the cell 
30 lysate, although it was relatively small, was subtracted from the peak areas of the product in 
the enzymatic reactions. Improvement of this analysis was obtained by further dilution of the 
samples and the use of lower injection volumes on the HPLC. Following the implementation 
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of these improvements, interference by this peak was shown to be minimal, as shown in the 
chromatogram illustrated in Fig. 6C 

[000313] The chiral analysis of phenylglycine is shown in chromatogram in Fig. 6D with 
the L-enantiomer eluting at 6 min and the D-enantiomer at 1 1 min. Good resolution between 
5 the two isomers was obtained. However, the column used was very sensitive and the 

characteristics of the column appeared to change over time, resulting in changes in the elution 
times of the acids. While this was easily detected by the use of the proper controls and 
standards, a greater problem existed in the coelution of the nitrile peak with the D-enantiomer 
(chromatogram shown in Fig. 6E). The cause of this coelution was unclear; however, it was 
10 easily detected by the use of appropriate standards; in addition, the UV spectrum of the acid 
was very distinctive, making the use of this tool effective in detecting the coelution. The 
problem was also easily resolved by adjusting the methanol content in the mobile phase. 
(i?V2-chloromandelic acid 

[000314] The HPLC analysis of chloromandelic acid and chloromandelonitrile offered 
15 many of the challenges associated with the analysis of the phenylglycine samples. From the 
chromatogram shown in Fig. 7A, which contains only chloromandelonitrile in buffer, it is 
evident that a peak eluted at the same time as the product in the chromatogram shown in Fig. 
7B, which represents a chloromandelic acid standard. The contribution of the cell lysate to 
this peak was found to be small; it would appear that the greatest contribution to this peak 
20 was from the chloromandelonitrile, either from a breakdown product or a contaminant in the 
nitrile preparation. The peak area remained constant throughout each experiment and, using 
the appropriate controls, it was found that subtraction of the peak area from that of the 
product yielded sufficient accuracy. Many attempts were made to change the HPLC 
conditions so that the product peak eluted at a later time; however, these attempts were not 
25 successful. Chromatogram shown in Fig. 7C illustrates the appearance of product and the 
reduction of the substrate peaks. 

[000315] The chiral analysis of chloromandelic acid was almost problem-free. The elution 
of a small peak at the same time as the (5)-enantiomer presented some concern (the peak at 
2.4 min in chromatogram shown in Fig. 7D). However, once it was established that this peak 
30 was present in all the samples at the same level, including the blank control, and that it had a 
different UV spectrum to that of the chloromandelic acid peak, it was not regarded as a 
problem. Consequently, it was subtracted from the peak eluting at 2.4 min in each sample. 
The (R)-enantiomer eluted at 3 minutes. 
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G?)-phenyllactic acid 

[000316] The analysis of phenyllactic acid was initially plagued with the same problems 
discussed for phenylglycine and 2-chloromandelic acid. However, in this case, adjustment of 
the solvent concentration in the nonchiral HPLC method led to a shift in the retention time of 
5 the acid, so that it no longer coeluted with the cell lysate peak. Following this, no problems 
were encountered with either the nonchiral or chiral methods. Representative nonchiral 
chromatograms of the product (1 .9 min) and cyanohydrin substrate (3.7 min) are shown in 
Fig. 8A, while the chiral analysis of the acid is shown in Fig. 8B, with the L-enantiomer 
eluting at 2 min and the opposite enantiomer at 6 min. 

10 L-2-methvlphenvlglvcine 

[000317] The analysis of methylphenylglycine was unproblematic, although the nonchiral 
method did not provide baseline separation between a cell lysate peak and the product peak, 
as shown in the chromatogram illustrated in Fig. 9A. The amino acid standard for this 
method was provided in the final stages of the project, thus minimizing the time for method 

1 5 development. In the chromatogram shown in Fig. 9A the amino acid elutes at 0.7 min and 
the aminonitrile at 5.0 min. Sufficient separation between the two initial peaks was obtained 
to allow the calculation of approximate conversion to product. 

[000318] The chiral analysis of this compound provided good separation between the two 
enantiomers, as shown in the chromatogram illustrated in Fig. 9B. The L-enantiomer elutes • 
20 at 5 min and the D-enantiomer at 8 min. 
L-fert-leucine 

[000319] For the nonchiral analysis of /-leucine, the cell lysate presented the most serious 
problem amongst the group of products for this project. This was compounded by the low 
spectroscopic properties of the amino acid, leading to difficulty in differentiating the product 

15 peak from the cell lysate. Good separation of the individual product enantiomers was 

obtained by chiral analysis as shown in Fig. 10A. During the primary screen, a small peak 
eluted at the same time as the L-amino acid standard in certain samples (see Fig. 10B) and 
was thought to be the amino acid. However, further development of the method and the use 
of the appropriate controls established that this peak was actually a cell lysate peak. 

0 [000320] The aminonitrile eluted between the two f-leucine peaks, as shown in Fig. 10C; 
this chromatogram also shows the cell lysate peak at 4.8 min. The UV spectrum of the 
nitrile was distinct from that of the amino acid, making it easier to differentiate from the acid 
peaks. 
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L-hydroxyporleucine ( (Sy2-amino-6-hvdroxv hexanoic acid) 

[000321] The chiral analysis of (S)-2-amino-6~hydroxy hexanoic acid was consistent and 
reliable. By contrast, the nonchiral method presented many problems, primarily as a result of 
non-separation between the nitrile and the acid peaks. Towards the latter half of the project, a 
5 method was developed and used successfully for the confirmation of activities. Prior to this, 
most of the analysis was performed using the chiral method; standard curves of the products 
were run in order to quantify the reactions. A representative chromatogram of (S)-2-amino- 
6-hydroxy hexanoic acid is shown in Figure 1 1 A, with (S)-2-amino-6-hydroxy hexanoic acid 
eluting at 6 min. The aminonitrile was not detected by this method. 

1 0 [000322] Separation of the individual 2-amino-6-hydroxy hexanoic acid enantiomers is 

shown in Fig. 1 IB. The L-enantiomer elutes first, at 2 min, followed by the D-enantiomer at 
3 min. In Fig. 1 1C, an enzymatic sample is represented; the only area of slight concern is the 
negative peak preceding the elution of the L-enantiomer. However, it did not appear to 
interfere significantly with the elution of this enantiomer; method development did not 

1 5 eliminate the negative peak. 

4-methvl-D-leucine and 4-methvl-L-leucine 

[000323] For the detection of 4-methylleucine, the chiral HPLC method again proved more 
reliable. The combination of low activities, together with the low sensitivity of the method to 
the compound led to difficulties in detection using nonchiral HPLC. A 2.5 mM standard of 

20 the amino acid is shown in Fig. 12A, with a peak height of approximately 40 mAU; this was 
substantially lower than those detected for the aromatic compounds. Chromatogram in Fig. 
12B shows an enzymatic sample, in which conversion was detected using the chiral HPLC 
method; while it is not clear, it would appear that the 4-methylleucine peak elutes at 2.7 min 
and is extremely low in both peak height and area. This peak did not appear in samples 

25 which were negative by chiral HPLC analysis. 

[000324] The chiral analysis of 4-methyl-L-leucine and 4-methyl-D-leucine did not present 
any problems. The L-enantiomer eluted at 5 min and the D-enantiomer at 7 min, although 
some peak shift did occur, as a result of the sensitivity to the column, described in section (i) 
for phenylglycine. In chromatograms shown in Figs. 14C-14D, the separation of these amino 

30 acids is shown; the first sample represents an enzyme which produced both enantiomers and 
in the second sample, the enzyme preferentially hydrolyzed the L-enantiomer, with a small 
amount D-amino acid forming. 
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(»?)-cyclohexylmandelic acid 

[000325] Chromatograms of the standards for cyclohexylmandelic acid (Fig. 13 A) and the 
corresponding nitrile (Fig. 13B) are shown. The acid eluted at 1.3 min, while the 
cyanohydrin was observed at 2.5 min. The peak eluting at 2.1 min is thought to be the 
5 cyclohexylphenylketone, as shown by the elution of a ketone standard at this point. 

Example 12: An Enzyme Library Approach to Biocatalvsis: Development of aNitrilase 
Platform for Enantioselective Production of Carboxvlic Acid Derivatives 

[000326] Biocatalytic processes can offer unique advantages in transformations that are 

challenging to accomplish through conventional chemical methods (Wong, C.-H.; 

10 Whitesides, G.M. Enzymes in Synthetic Organic Chemistry; Pergamon, New York, 1994; 
Drauz, K.; Waldinann, H., Roberts, S.M. Eds. Enzyme Catalysis in Organic Synthesis; VCH: 
Weinheim, Germany, 2nd ed., 2002). Nitrilases (EC 3.5.5.1) promote the mild hydrolytic 
conversion of organonitriles directly to the corresponding carboxylic acids (Kobayashi, M.; 
Shimizu, S. FEMS Microbiol Lett. 1994, 120, 217; Bunch, A.W. In Biotechnology; Rehm, 

15 H.-J.; Reed, G.; Puhler, A.; Stadler, P., Eds.; Wiley- VCH: Weinheim, Germany, Vol. 8a, 
Chapter 6, pp 277-324; Wieser, M.; Nagasawa, T. In Stereoselective Biocatalysis; Patel, 
R.N., Ed.; Marcel Dekker: New York, 2000, Chapter 17, pp 461-486.) Fewer than fifteen 
microbially-derived nitrilases have been characterized and reported to date. (Harper, D.B. Int. 
J. Biochem. 1985, 17, 677; Levy-Schil, S.; Soubrier, F.; Crutz-Le Coq, A.M.; Faucher, D.; 

20 Crouzet, J.; Petre, D. Gene 1995, 161, 15; Yu, F. 1999, US Patent 5872000; Ress-Loschke, 
M.; Friedrich, T.; Hauer, B.; Mattes, R.; Engels, D. PCT Appl. WO 00/23577, April 2000.). 
Several nitrilases previously have been explored for the preparation of single-enantiomer 
carboxylic acids, although little progress has been made in the development of nitrilases as 
viable synthetic tools. This application describes the discovery of a large and diverse set of 

25 nitrilases and herein demonstrate the utility of this nitrilase library for identifying enzymes 
that catalyze efficient enantioselective production of valuable hydroxy carboxylic acid 
derivatives. 

[000327] In an effort to access the most diversified range of enzymes that can be found 
in Nature, we create large genomic libraries by extracting DNA directly from environmental 
\0 samples that have been collected from varying global habitats. (For a description of these 
methods, see: Short, J.M. Nature Biotech. 1997, 15, 1322; Handelsman, J.; Rondon, M.J.; 
Brady, S.F.; Clardy, J.; Goodman, R.M. Chem. Biol 1998, 5, R245; Henne, A.; Daniel, R.; 
Schmitz, R.A.; Gottschalk, G. Appl Environ. Microbiol 1999, 65, 3901.). We have 
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established a variety of methods for identifying novel activities through screening mixed 
populations of uncultured DNA. (Robertson, D.E.; Mathur, E.J.; Swanson, R.V.; Mairs, 
B.L.; Short, J.M. SIM News 1996, 46, 3; Short, J.M. US Patent 5,958,672, 1999; Short J.M. 
US Patent 6,030,779, 2000.) Through this approach, nearly 200 new nitrilases have been 
discovered and characterized. (For a concise description of the studies, see Materials and 
Methods section below.) All nitrilases were defined as unique at the sequence level and were 
shown to possess the conserved catalytic triad Glu-Lys-Cys which is characteristic for this 
enzyme class. (Pace, EL; Brenner, C. Genome Biology 2001, 2, 0001.1-0001.9.) Each 
nitrilase in our library was overexpressed and stored as a lyophilized cell lysate in order to 
facilitate rapid evaluation of the library for particular biocatalytic functions. 
[000328] The initial investigations focused upon the efficacy of nitrilases for production of 
a-hydroxy acids 2 formed through hydrolysis of cyanohydrins 1 . Cyanohydrins are well- 
documented to racemize readily under basic conditions through reversible loss of HCN. 
(Inagaki, M.; Hiratake, J.; Nishioka, T.; Oda, I; J. Org. Chem 1992, 57, 5643. (b) van 
Eikeren, P. US Patent 5,241,087, 1993.) Thus, a dynamic kinetic resolution process is 
possible whereby an enzyme selectively hydrolyzes only one enantiomer of 1, affording 2 in 
100% theoretical yield and with high levels of enantiomeric purity. 
[000329] One important application of this type involves commercial production of (R)- 
mandelic acid from mandelonitrile. (Ress-Loschke, M.; Friedrich, T.; Hauer, B.; Mattes, R.; 
Engels, D. PCT Appl. WO 00/23577, April 2000; Yamamoto, K.; Oishi, K.; Fujimatsu, L; 
Komatsu, K. Appl Environ. Microbiol 1991, 57, 3028; Endo, T.; Tamura, K. US Patent 
5,296,373, March 1994.) Mandelic acid and derivatives find broad use as intermediates and 
resolving agents for production of many pharmaceutical and agricultural products. (Coppola, 
G.M.; Schuster, H.F. Chiral oc-Hydroxy Acids in Enantioselective Synthesis; Wiley- VCH: 
Weinheim, Germany: 1997.) However, the few known nitrilases derived from cultured 
organisms have not been found useful for efficient and selective hydrolysis of analogous 
substrates. 

O OH OH OH 

A + HCN ^ ^ + I Nitrilas f I 
R H R CN R"XN R^C0 2 H 

(S)-1 (R)-1 (S)-2 

[000330] The nitrilase library was screened for activity and enantioselectivity in the 
hydrolysis of mandelonitrile (3a, Ar = phenyl) to mandelic acid. Preliminary results revealed 
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that 27 enzymes afforded mandelic acid in >90% ee. One enzyme, SEQ ID NOS:385, 386, 
was studied in greater detail and was found to be very active for hydrolysis of mandelonitrile. 
Under standard conditions using 25 mM 3a and 0.12 mg/mL enzyme in 10% MeOH (v/v) 0.1 
M phosphate buffer at 37oC and pH 8, (J?)-mandelic acid was formed quantitatively within 
10 min and with 98% ee. To confirm synthetic utility, the reaction was performed using 1.0 g 
3a (50 mM) and 9 mg nitrilase (0.06 mg/mL nitrilase I); after 3 h (R)-mandelic acid was 
isolated in high yield (0.93 g, 86%) and again with 98% ee. 

(a) Reactions were conducted under standard conditions (see text). Reaction time for 
complete conversion to 4 was 1-3 h. Entries 8-9 were conducted at pH 9 and 5 mM substrate 
concentration, (b) Specific activities were measured at 5 min transformation timepoints and 
are expressed as jimol mg* 1 min \ (c) TOF = turnover frequency, mol product/mol 
catalyst/sec. (d) Enantioselectivites were determined by chiral HPLC analysis. Hydroxy acids 
were isolated and absolute configurations were determined to be (R) in all cases. 

[000331] the substrate scope of SEQ ID NOS:385, 386 was next explored. As shown 
in Table 13, a broad range of mandelic acid derivatives as well as aromatic and 
heteroaromatic analogues (4) may be prepared through this method. SEQ ID NOS:385, 386 
tolerates aromatic ring substituents in the ortho-, meta^ and jcwra-positions of mandelonitrile 
derivatives and products of type 4 were produced with high enantioselectivities. Other larger 
aromatic groups such as 1-naphthyl and 2-naphthyl also are accommodated within the active 
site, again affording the acids 4 with high selectivity (Table 13, entries 8-9). Finally, 3- 
pyridyl and 3-thienyl analogues of mandelic acid were prepared readily using this process 
(Table 13, entries 10-1 1). This is the first reported demonstration of anitrilase that affords a 
range of mandelic acid derivatives and heteroaromatic analogues of type 4. High activity on 
the more sterically encumbered orrAo-substituted and 1-naphthyl derivatives is particularly 
noteworthy. 
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O H SEQ ID NOS: O H 

Ar 1 103,104 Ar - 

5 (S)-6 

[000332] We next examined the preparation of aryllactic acid derivatives 6 through 
hydrolysis of the corresponding cyanohydrins 5. Phenyllactic acid and derivatives serve as 
versatile building blocks for the preparation of numerous biologically active compounds. 
(Coppola, G.M.; Schuster, H.F. Chiral ct-Hydroxy Acids in Enantioselective Synthesis; 
5 Wiley-VCH: Weinheim, Germany: 1997.) Upon screening our nitrilase library against the 
parent cyanohydrin 5a (Ar = phenyl), we found several enzymes that provided 6a with high 
enantiomeric excess. One enzyme, SEQ ID NOS: 103, 104, was further characterized. After 
optimization, SEQ ID NOS: 103, 104, was shown to provide (5)-phenyllactic acid (6a) with 
complete conversion (50 mM) and very high enantioselectivity (98% ee) over 6 h. The 
10 highest enantioselectivity previously reported for biocatalytic conversion of 5 to 6 was 75% 
ee achieved through a whole cell transformation using a Pseudomonas strain. (Hashimoto, 
Y.; Kobayashi, E.; Endo, T.; Nishiyama, M.; Horinouchi, S. Bioscl Biotech. Biochem, 1996, 
60, 1279.) 

Table 7. Nitrilase H-catalyzed production of aryllactic acid derivatives and analogues 6° 
Entry Arin6 Spec. Act. 6 TOP %e? 

1 CeRs 25 16 99 

2 2-Me-C6H 5 160 100 95 

3 2-Br-C6H 5 121 76 95 

4 2-F-C6H5 155 97 91 

5 3-Me-C6H 5 21 13 95 

6 3-F-C 6 H 5 22 14 99 
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7 1-naphthyl 64 40 96 

8 2-pyridyl 10.5 6.6 99 

9 3-pyridyl 11.6 7.2 97 

10 2-thienyl 3.4 2.1 96 

11 3-thienyl 2.3 1.4 97 



(a) Reaction conditions as in Table 13, except 0.016 mg/mL nitrilase was used. Full 
conversion to 6 was observed within 6 h. (b)-(d) See Table 13. The absolute configuration 
was determined to be (S) for phenyllactic acid and entries 2-1 1 were assigned (S) based upon 
identical chiral HPLC peak elution order. 

5 [000333] Ortho and meta substituents appear to be tolerated well by nitrilase II, with 
ortho substituted derivatives surprisingly being converted with higher rates relative to the 
parent substrate 5a. Novel heteroaromatic derivatives, such as 2-pyridyl-, 3-pyridyl, 2- 
thienyl- and 3-thienyllactic acids, were prepared with high conversions and 
enantioselectivities (entries 8-11). Unexpectedly, para substituents greatly lowered the rates 
1 0 of these reactions, with full conversion taking over two weeks under these conditions. 

[000334] The final transformation that we examined was desymmetrization of the 
readily available prochiral substrate 3-hydroxyglutarylnitrile (7) (Johnson, F.; Panella, J.P.; 
Carlson, A.A. J. Org. Chem. 1962, 27, 2241) to afford hydroxy acid (i?)-8 which, once 
esterified to (J?)-9, is an intermediate used in the manufacture of the cholesterol-lowering 

1 5 drug LIPITOR™. Previously reported attempts to use enzymes for this process were 

unsuccessful and 8 was produced with low selectivity (highest: 22% ee) and the undesired 
(^-configuration. (Crosby, J.A.; Parratt, J.S.; Turner, N.J. Tetrahedron: Asymmetry 1992, 3, 
1547; Beard, T.; Cohen, M.A.; Parratt, J.S.; Turner, N.J. Tetrahedron: Asymmetry 1993, 4, 
1085; Kakeya, H.; Sakai, N.; Sano, A.; Yokoyama, M.; Sugai, T.; Ohta, H. Chem. Lett 1991, 

20 1823.) 
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[000335] 



The nitrilase library was screened and unique enzymes were discovered and 



isolated that provided the required product (if)-8 with high conversion (>95%) and >90% ee. 
Using one of the (i?)-specific nitrilases, this process was operated on a 1.0 g scale (240 mM 7, 
30 mg enzyme, 22°C, pH 7) and after 22 h, (R}-8 was isolated in 98% yield and 95% ee. 
Interestingly, the same screening program also identified nitrilases that afford the opposite 
enantiomer (5)-8 with 90-98% ee. Thus, the extensive screen of biodiversity has uncovered 
enzymes that provide ready access to either enantiomer of the intermediate 8 with high 
enantioselectivities. Our discovery of the first enzymes that furnish (R)-S underscores the 
advantage of having access to a large and diverse library of nitrilases. 

[000336] By plumbing our environmental genomic libraries created from uncultured 
DNA, we have discovered a large array of novel nitrilases. This study has revealed specific 
nitrilases that furnish mandelic and aryllactic acid derivatives, as well as either enantiomer of ', 
4-cyano-3-hydroxybutyric acid in high yield and enantiomeric excess. 

Procedures and Analytical Data: 

[000337] Hydroxyglutarylnitrile was purchased from TCI America and used as 
received. Amino acids used for the preparation of aryl lactic acid standards were purchased 
from PepTech (Cambridge, MA). (i?)-3-hydroxy-4-cyanobutyric acid was obtained from 
Gateway Chemical Technology (St. Louis, MO). Both (R)- and (5)- mandelic acid and (R)- 
and (S)- phenyl lactic acid standards were purchased from Sigma Aldrich. All other reagents 
were purchased from Sigma Aldrich and utilized without further purification. Silica Gel, 70- 
230 mesh, 60 A, purchased from Aldrich, was used for chromatographic purifications. All J H 
NMRs and I3 C NMRs were run on Bruker model AM-500 machines, set at room temperature, 
500 MHz and 125MHz respectively for l U and 13 C. Mass analyses and unit mass resolution 
was achieved by flow injection analysis (FIA) using a Perkin-Elmer Sciex API-4000 
TURBOION™ Spray LC/MS/MS system. The LC flow was provided by Schimadzu LC- 
lOAdvp pumps, with 0.05% acetic acid and MeOH. Injections were accomplished via a 
Valco injector valve. The HPLC analysis was done on an Agilent 1 100 HPLC with Astec's 
Chirobiotic R column (100 x 4.6 mm, cat no. 13022 or 150 x 4.6 mm, cat no. 13023) or 
Daicel's Chiralcel OD column (50 x 4.6 mm, cat no. 14022) and the DAD detector set at 210, 
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220, 230, and 250 nm. For specific rotations, a Perkin Elmer Model 341 Polarimeter was 
used, set at 589 nm, Na lamp, at room temperature, with a 100 mm path length cell. 
Concentrations for specific rotation are reported in grams per 100 mL of solvent. 
Microbiology techniques were executed in accordance to published protocols. (Sambrook, J. 
5 Fritsch, EF, Maniatis, T. (1989) Molecular Cloning: A Laboratory Manual (2nd ed.), Cold 
Spring Harbor Laboratory Press, Plainview NY.) Glycolic acid products were isolated and 
absolute configurations were determined to be (R) in all cases by comparison with literature 
optical rotation data on configurationally defined compounds except for (-)-3-pyridylglycolic 
acid, which to our knowledge is not known as a single enantiomer. (For mandelic, 2- 

10 chloromandelic, 2-methyl mandelic, 3-chloromandelic, 3-bromomandelic and 4- 

fluoromandelic acid see Hoover, J.R. E.; Dunn, G. L.; Jakas, DJR.; Lam, L.L.; Taggart, J. J.; 
Guarini, J.R.; Phillips, L. J. Med. Chem. 1974, 17(1), 34-41; For 2-bromo mandelic acid see 
Collet, A.; Jacques, J.; Bull Soc. Chem. Fr. 1973, 12, 3330-3331; For 1- and 2- 
napthylglycolic acid see Takahashi, I; Y. Aoyagi, I. Nakamura, Kitagawa, A., Matsumoto, 

15 K., Kitajima, H. Isa, K. Odashima, K. Koga, K. Heterocycles 1999, 51(6), 1371-88; For 3- 
thienylglycolic acid Gronowitz, S.Ark Kerni, 1957, 11, 519-525.) 

[000338] For the aryl lactic acid products, absolute configuration was established to be 
(S) for phenyl lactic acid by comparison with literature optical rotation and for all other 
phenyl lactic acid products, absolute configurations were predicted based upon elution order 
20 using chiral HPLC. Absolute configuration for 3-hydroxy-4-cyano-butanoic acid was 
established by derivatization to (i?)-(-)-Methyl (3-0-[benzoyl]-4-cyano)-butanoate and 
comparison to literature optical rotation data on configurationally defined compound. (3. 
Beard, T. Cohen, M. A. Parratt, J.S. Turner, N. J. TetrahedroniAsymm. 4(6), 1993, 1085- 
1104.) 

25 Nitrilase Discovery and Characterization Methods: 
1. Nitrilase Selection. 

[000339] An Escherichia coli screening host strain, SEL700, was optimized for nitrilase 
selections on a nitrile substrate. An Abs 6 oonm = 1, resuspension of SEL700 screening host in 
10 mM MgS0 4 was infected with kanamycin-resistant environmental DNA library for 45 
30 minutes at 37°C, such that complete screening coverage of the library was achieved. Infected 
cells, now denoted by kanamycin resistance, were plated on kanamycin LB plates and 
allowed to grow overnight at 30°C. Titer plates were also made to determine infection 
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efficiency. Cells were pooled, washed, and resuspended the next morning with 10 mM 
MgS0 4 . Transformed clones were inoculated into M9 media (without nitrogen) with 10 mM 
of nitrile substrate. M9 media consisted of IX M9 salts (NH 4 C1 omitted), O.lmM CaCl 2 , 1 
mM MgS0 4 , 0.2 % glucose, and approximately 10 mM of a nitrile selection substrate. The 
5 selection cultures were then incubated at 30°C, shaking at 200 rpm, for up to five weeks. 
Positive nitrilase cultures were identified by growth, due to positive clone's ability to 
hydrolyze nitrile substrate. Positive clones were isolated by streaking out a selection culture 
with growth and subsequent secondary culturing of isolated colonies in the same defined 
media. The DNA from any positive secondary cultures exhibiting re-growth was then 
10 isolated and sequenced to confirm discovery of a nitrilase gene and to establish the unique 
nature of that gene. 
2. Nitrilase Biopanning. 

[000340] Traditional filter lift hybridization screening protocols are limited to libraries 
with approximately 10 6 to 10 7 members. Attempting to screen one library would require 

15 approximately 5,000 filter lifts. Therefore, solution phase and other biopanning formats have 
been developed for ultra high throughput sequence based screening permitting rapid 
screening of up to 10 8 member environmental libraries In the solution format, the DNA 
from a large number of library clones is mixed with tagged molecules of interest under 
conditions which promote hybridization. The tagged clones and hybridized DNA are then 

20 removed from solution and washed at some level of stringency to remove clones which do 
not have sequence identity with the probe. The hybridized DNA is then eluted and recovered. 
Clones of interest are sequenced and cloned to provide enzyme activities of interest. This 
method has been demonstrated to achieve up to 1,000-fold enrichment per round for 
sequences of interest. 

25 3. High Throughput Nitrilase Activity Assay. 

[000341] Activity assays were conducted using 25 mM (-3 mg/mL) substrate, 0. 1 
mg/mL nitrilase in 0.25 mL of assay solution. Assay solutions consisted of 0-10% (v/v) 
MeOH in 0.1 M sodium phosphate buffer solution at pH 7 to 9 and temperatures 37°C or 
22°C. Specific activities were measured at 5 min transformation time point, unless otherwise 

30 noted, and are expressed in units ^mol mg" 1 min" 1 . Enantiomeric excess and conversion rates 
were determined by high throughput HPLC analysis comparing enzyme product 
concentration to standard curves of racemic acid products. Analytical conditions for the 
products are tabulated below. 
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Analytical Methods: 





Acid Product 


Column 


Liquid Chromatography Method 


Retention Times of 
enantiomers (min) 


1.1 


mandelic acid 


Chirabiotic R 
100 x 4.6 mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


2.4 (5); 2.9 (*) 


1.2 


2-Cl-mandelic 
acid 


Chirabiotic R 
100x4.6 mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


2.3 (5); 2.9 (R) 


1.3 


2-Br-mandelic 
acid 


Chirabiotic R 
100x4.6 mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 iriVmin 


2.8; 4.0 


1.4 


2-CH 3 -mandelic 
acid 


Chirabiotic R 
100x4.6 mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 rxil/min 


3.1; 3.8 


1.5 


3-Cl-mandelic 


Chirabiotic R 
100x4.6 mm 


10%[0.5% AcOH], 90% CH 3 CN 
1 ml/min 


3.1; 3.8 


1.6 


3-Br-mandelic 


Chirabiotic R 
100x4.6 mm 


10%[0.5% AcOH], 90% CH 3 CN 
1 ml/min 


3.3; 3.9 


1.7 


4-F-mandelic 


Chirabiotic R 
150x4.6 mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 rnl/min 


3.7; 4.8 


1.8 


1-napthylglycolic 
acid 


Chirabiotic R 
100 x 4.6 mm 


4%[0.5% AcOH], 96% CH 3 CN 
1 ml/min 


3.1; 3.7 


1.9 


2-napthylglycolic 
acid 


Chirabiotic R 
100 x 4.6 mm 


4%[0.5% AcOH], 96% CH 3 CN 
1 ml/min 


3.7; 4.7 


1.10 


3-pyridylglycolic 
acid 


Chirabiotic R 
100x4.6 mm 


5% [0.5% AcOH], 65% H 2 0, 
30% CH 3 CN, 2 rmVnrin 


4.4; 5.5 


1.11 


3-thienylglycolic 
acid 


Chirabiotic R 
100x4.6 mm 


20%[0.5% AcOH], 80% CH 3 CN 
2 ml/min 


1.4; 2.5 


2.1 


phenyl lactic acid 


Chirabiotic R 
150 x 4.6 mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ral/min 


2.8(5): 4.0 (R) 


2.2 


2-methylphenyl 
lactic acid 


Chirabiotic R 
150x4.6 mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


2.5; 2.8 


2.3 


2-bromophenyl 
lactic acid 


Chirabiotic R 
150x4.6 mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


2.8; 3.2 


2.4 


2-fluorophenyl 
lactic acid 


Chirabiotic R 
150x4.6 mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


2.6; 2.9 


2.5 


3-methylphenyl 
lactic acid 


Chirabiotic R 
150x4.6 mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


2.4; 3.2 


2.6 


3-fluorophenyl 
lactic acid 


Chirabiotic R 
150 x 4.6mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


2.8; 3.6 


2.7 


1-napthyllactic 
acid 


Chirabiotic R 
1 50 x 4.6mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


2.7; 3.1 


2.8 


2-pyridyllactic 
acid 


Chirabiotic R 
1 50 x 4.6mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


2.5; 2.9 


2.9 


3-pyridyllactic 
acid 


Chirabiotic R 
150 x 4.6mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


2.9; 3.6 


2.10 


2-thienyllactic 
acid 


Oiirabiotic R 
1 50 x 4.6mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


3.6; 4.6 


2.11 


3-thienyllactic 
acid 


Chirabiotic R 
150 x 4.6mm 


20%[0.5% AcOH], 80% CH 3 CN 
1 ml/min 


3.5; 4.6 




Methyl(3-0 
[benzoyl]-4- 
cyano)-butanoate 


Daicel OD 
50x4.6 mm 


5% isopropanol, 95% hexane 
1 ml/min 


4.5 (R); 5.4(5) 



Cvanohvdrin (Substrate) Synthesis: 
5 [000342] Mandelonitrile Synthesis Method A: Acetone cyanohydrin (685 nL, 7.5 mmol), 
aldehyde (5 mmol), and catalytic DIEA (13 ^L, 0.075 mmol) were mixed at 0 °C. The 
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reactions were stirred on ice for 45 minutes. To drive the equilibrium toward the product, 
acetone was removed in vacuo. Subsequently, crude reactions were acidified with H2SO4 (3 
HL) and stored at -20°C. TLC was used to monitor reaction progress (3:1 hexane/ethylacetate 
(EtOAc). 

5 [000343J Mandelonitrile Synthesis Method B: To a solution of KCN (358 mg, 5.5 mmol) in 
MeOH (1 mL) at 0°C was added aldehyde (5 mmol) and acetic acid (315 ^iL, 5.5 mmol). 
After stirring for one hour on ice, MeOH was removed in vacuo^ and the crude mixture was 
partitioned using EtOAc and H2O. The organic fraction was retained and concentrated in 
vacuo. TLC analysis was used to monitor reaction progress (3:1 Hexanes/EtOAc). 

10 [000344] Aryl Acetaldehyde Cyanohydrin:, Arylacetic acid (50 mmol) was dissolved in 50 
ml anhydrous tetrahydrofuran (THF) in a two-neck 500 ml round-bottom flask under N2(g) 
atmosphere. To this solution cooled to 0 °C, under vigorous mixing, was added slowly 105 
mmol of thexylchloroborane-dimethyl sulfide (2.55 M in methylene chloride). The reaction 
was allowed to proceed overnight. Excess acetic acid (10 ml) was added to quench and 

15 acidify the reaction followed by the addition 10 ml water. After stirring at room temperature 
for 1 hour, solvent was removed in vacuo and the residue was dissolved in 100 ml water and 
extracted with 200 ml EtOAc. The EtOAc layer was dried over sodium sulfate, filtered and 
then concentrated in vacuo. Subsequently, 60 mmol of KCN, followed by 100 ml methanol 
was added to the residue. The solution was then cooled to 0 °C and acetic acid (60 mmol) 

20 added. The reaction was stirred for 1-2 hours after all KCN dissolved. Solvents were removed 
in vacuo and residue was dissolved in 100 ml water and 200 ml EtOAc. The aqueous layer 
was extracted with EtOAc one more time. Combined EtOAc extracts were washed with 
saturated brine and dried over sodium sulfate, filtered and then concentrated in vacuo to 
obtain crude cyanohydrin product. The cyanohydrin was purified by silica-gel column 

25 (hexane/EtOAc), as necessary. 

[000345] 2-chloro mandelonitrile: *H NMR (CDC1 3 , 500 MHz) 5 7.69 (m, 1H), 7.41 (m, 
1H), 7.36 (m, 2H), 5.84 (s, 1H), 3.07 (br, 1H). 13 C NMR (CDCI3, 125 MHz) 5 132.89, 
132.73, 131.22, 130.19, 128.48, 127.84, 118.24, 60.87. MS calc'd for [C 8 H 6 C1N0] 167.01 
found 167.9 (LC-MS+). 

30 [000346] 2-bromomandelonitrile: *H NMR (CDCI3, 500 MHz) 8 7.72 (d, 1H, J~ 6.58), 7.62 
(d, 1H, J= 8.35), 7.43 (t, 1H, J= 8.42), 7.30 (t, 1H, J= 7.00), 5.85 (s, 1H). 13 C NMR (CDCI3, 
125 MHz) 8 134.550, 133.584, 131.564, 128.819, 128.535, 122.565, 118.153, 63.379. 
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[000347] 2-methylmandelonitrile: 'H NMR (CDC1 3 , 500 MHz) 8: 7.60 (d, 1H, J = 7.4), 
7.23-7.35 (m, 3H), 5.66 (s, 1H), 2.44 (s, 3H). ,3 C NMR (CDC1 3 , 298 K, 125 MHz) 8: 
136.425, 133.415, 131.450, 130.147,127.204, 126.894, 118.952, 18.916. MScalc'dfor 
[C9H9NO] 147.07, found 147.2 (ESI+). 
5 [000348] 3-chloromandelonitrile: 'H NMR (CDC1 3> 500 MHz) 5 7.55 (s, 1H), 7.43-7.37 (m, 
3H), 5.54 (s, 1H). 13 C NMR (CDCI3, 125 MHz) 8 137.183, 135.480, 130.718, 
130.303,127.047, 124.891, 118.395, 63.156. MS calc'd for [CuHeCINO] 167.01 found 167.9 
(LC-MS +). 

[000349] 3-bromomandelonitrile: ! H NMR (CDC1 3> 500 MHz) 8 7.69 (s, 1H), 7.56 (d, 7= 
10 6.2 Hz, 1H), 7.45 (d, 7= 5.5Hz, 1H), 7.32 (t, 7= 6.4. Hz, 1H), 5.53 (s, 1H). I3 C NMR (CDCI3, 
125 MHz) 8 137.376, 133.201, 130.934, 129.208,125.359, 123.380, 118.458, 63.006. MS 
calc'd for [CgHeBrNO] 212.0 found 211.9 (LC-MS +). 

[000350] 4-fluoromandelonitrile: *H NMR (CDC1 3 , 500 MHz) 8 5.54 (s, 1H), 7.13 (m, 2H), 
7.51-7.53 (m, 2H). 13 CNMR(CDa 3 , 125 MHz) 8 63.02, 116.44, 118.97, 128.90, 131.54, 
15 132.51, 162.575. 

[000351] 4-chloromandelonitrile: 'H NMR (CDC1 3 , 500 MHz) 8 7.47 (d, 7= 7.0 Hz , 2H), 
7.42 (d, 7= 7.0 Hz , 2H), 5.53 (s , 1H). I3 C NMR (CDCI3, 125 MHz) 8 136.209, 133.845, 
129.647, 128.232, 118.630, 63.154. MS calc'd for [C 8 H6C1N0] 167.01 found 167.9 (LC-MS 
+) 

20 [000352] l-naphthylcyanohydrin: 1 HNMR(CDCl 3 ,500MHz)8 8.14(d, lH,y=8.5), 
7.92 (t, 2H, 7= 6.1), 7.82 (d, 1H, 7= 5.7), 7.62 (t, 1H, 7= 6.1), 7.56 (t, 1H, 7= 6.1), 7.50 (t, 
1H,7=6.1), 6.18 (s, 1H); 13 CNMR(CDC1 3 , 125 MHz) 8 137.0, 135.7, 134.2, 131.1, 129.2, 
127.5, 126.7, 125.8, 125.3, 123.1, 119.0, 62.4; MS calc'd for [C12H9O] 183.21, found 183.2 
(ESI+). 

25 [000353] 2-naphthyl cyanohydrin: 'H NMR (CDC1 3 , 500 MHz) 8 8.03 (s, 1H), 7.92 (d, 1H, 
7= 8.6), 7.87-7.91 (m, 2H), 7.61 (dd, 1H, 7= 6.7, 1.2), 7.55-7.60 (m, 2H), 5.72 (s, 1H); 13 C 
NMR(CDC1 3 , 125 MHz) 8 134.9, 133.9, 132.7, 129.6, 128.6, 128.0, 127.4, 127.2, 126.4, 
123.9, 118.9, 64.1; MS calc'd for [C12H9O] 183.21, found 183.2 (ESI+). 
[000354] 3-pyridyl cyanohydrin: 'H NMR (CDCI3, 500 MHz) 8: 8.62 (d, 1H, 7= 1.8), 8.57 

30 (d, 1H,7= 5.1), 7.94 (d, 1H, 7= 8.1), 7.41 (dd, 1H, 7= 8.1, 5.1), 5.64 (s, 1H); ,3 C NMR 
(CDC1 3 , 125 MHz) 8 149.921, 147.355, 135.412, 133.044, 124.443, 118.980, 61.085. MS 
calc'd for [C 7 H 6 N 2 0] 134.05, found 135.2 (ESI +). 
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[000355] 3-thienyl cyanohydrin: ] H NMR (CDC1 3 , 500 MHz) 8 7.45 (d, J= 2.2 Hz 1H), 
7.56 (dd, J= 6.2 Hz, 1H), 7.45 (d, J= 5.5Hz, 1H), 7.32 (t,./= 6.4. Hz, 1H), 5.53 (s, 1H). ,3 C 
NMR (CDCI3, 125 MHz) 5 137.376, 133.201, 130.934, 129.208,125.359, 123.380, 118.458, 
63.006. MS calc'd for [CeHsNOS] 139.01 found 139.9 (LC-MS +). 
5 [000356] phenyl acetaldehyde cyanohydrin: l H NMR (CDC1 3 , 500 MHz) § 7.34 (m, 5H), 
4.64 (t, J= 6.75 Hz, 1H), 3.1 1 (d, J= 6.75 Hz, 2H), 2.75 (br, 1H), 13 C NMR (CDCI3, 125 
MHz) 8 133.96, 129.91, 129.16, 128.08, 119.47, 62.33, 41.55. 

[000357] 2-methylphenyl acetaldehyde cyanohydrin: *H NMR (CDCI3, 500 MHz) § 7.11 
(m, 4H), 4.61 (t, J= 6.62 Hz, 1H), 3.12 (d, J= 6.62 Hz, 2H), 2.14(s, 3H), 13 C NMR (CDCI3, 
10 125 MHz) 8 136.94, 136.47, 132.57, 130.48, 127.61, 125.75, 120.1 1, 62.95, 44.73 MS calc'd 
for [CioHnNO]: 161.08, found 162.2 (M+Na, ESI +) 

[000358] 2-bromophenyl acetaldehyde cyanohydrin: 'H NMR (CDC1 3 , 500 MHz) 5 7.20 
(m, 4H), 4.78 (t, /= 6.5 Hz, 1H), 3.26 (d, J= 6.5 Hz, 2H). 13 C NMR (CDC1 3 , 100 MHz) 8 
133.93, 132.82, 131.72, 129.21, 128.12, 124.86, 119.41, 63.02, 44.89. 
1 5 [000359] 2-fluorophenyl acetaldehyde cyanohydrin: ! H NMR (CDC1 3 , 500 MHz) 8 7.2 (m, 
2H), 7.02 (m, 2H), 4.50 (dd, J= 4.62 Hz, /= 7.88 Hz, IE), 3.23(dd, 7= 4.62 Hz, 1 J= 14.12 
Hz, 1H), 2.97 (dd, 7.88 Hz, 14.12 Hz, 1H). 13 C NMR (CDC1 3 , 125 MHz) S 132.18, 131.52, 
129.66, 129.03, 128.07, 124.05, 1 15.8, 63.02, 44.79 MS calc'd for [CsHsFNO] 165.06, found 
164.2 (ESI+). 

20 [000360] 3-methylphenyl acetaldehyde cyanohydrin: ! H NMR (CDC1 3 , 500 MHz) 8 7.18 
(m, 1H), 7.02 (m, 3H), 4.54 (dd, J = 4.62 Hz, /= 8 Hz, 1H), 3.06 (dd, J= 4.62 Hz, J= 14.38 
Hz, 1H), 2.83(dd, J= 8 Hz, J= 14.38 Hz, 1H), 2.36 (s, 3H) 13 C NMR (CDCI3, 125 MHz) 
8 176.25, 138.18, 136.0, 130.97, 128.93, 127.68, 126.58, 76.42, 34.29, 37.69 MS calc'd for 
[CioH, 2 0 3 ] 180.08, found 180.0 (ESI +). 

25 [000361] 3-fluorophenyl acetaldehyde cyanohydrin: 'H NMR (CDCI3, 500 MHz) 8 7.1 8 
(m, 2H), 6.95 (m, 2H), 4.44 (dd, 1H), 3.11(dd, 1H). 13 CNMR (CDCI3, 125 MHz) 8 130.40, 
125.53, 124.85, 116.92, 114.87, 114.50, 119.77, 61.97,41.27. 

[000362] 1-napthyl acetaldehyde cyanohydrin: J H NMR (CDCI3, 500 MHz) 8 8.07(m, 1H), 
7.86(m, 1H), 7.74(m, 1H), 7.41(m, 4H),4.20 (t, /= 7 Hz, 1H), 3.33 (d, J= 6.8 Hz, 2H) ,3 C 
30 NMR (CDCI3, 125 MHz) 8 177.7, 140.31, 129.74, 129.24, 128.92, 128.26, 127.84, 125.63, 
124.53, 124.05, 123.42, 70.58, 38.0 MS calc'd for [CisHnNO] 197.08, found 197.1 (ESI +). 
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[000363] 2-pyridyl acetaldehyde cyanohydrin: *H NMR (CDC1 3 , 500 MHz) 8 8.50 (m, 1H), 
7.85 (m, 1H), 7.48 (m, 1H), 7.34 (m, 1H), 4.42 (m, 1H), 3.19 (dd, J= 3.5 Hz, 7= 13.7 Hz, 
2H). 13 CNMR(CDC1 3 , 125 MHz) 8 157.44, 145.69, 140.24, 126.96, 126.16, 122.99, 60.30, 
42.60 MS calc'd for [CsHsW] 148.06, found 149.1 (ESI +). 
5 [000364] 3-pyridyl acetaldehyde cyanohydrin: *H NMR (CDCl 3s 500 MHz) 8 8.62 (d, 1H, J 
= 1.8), 8.57 (d, 1H,J= 5.1), 7.94 (d, 1H, J= 8.1), 7.41 (dd, 1H, J= 8.1, 5.1), 5.64 (s, 1H). 
13 CNMR(CDC1 3 , 125 MHz) 8: 149.921, 147.355, 135.412, 133.044, 124.443, 118.980, 
61.085. Exact Mass calculated for [CyHtffeO]: 134.05, found: 135.2 (ESI +). 
[000365] 2-thienyl acetaldehyde cyanohydrin:^ NMR (CDC1 3 , 500 MHz) 8 7.1 (m, 1H), 

10 6.9 (m, 1H), 6.8 (m, 1H), 4.11 (t, 7.0Hz, 1H), 2.86 (d, J= 7.0Hz, 2H). 13 C NMR (CDC1 3 , 
125 MHz) 8 127.68, 127.41, 125.58, 124.60, 118.70, 63.25, 44.84. 
[000366] 3-thienyl acetaldehyde cyanohydrin: *H NMR (CDC1 3 , 500 MHz) 8 7.09 (m, 
3H), 4.60 (t, J= 6.25Hz, 1H), 3.12 (d, J= 6.25Hz, 2H). l3 C NMR (CDCI3, 125 MHz) 
8 129.05, 127.16, 125.27, 122.65, 119.87, 61.58,44.90. 

15 [000367] Preparation of racemic mandelic acids standards from corresponding 

cyanohydrins: (Stoughton, R.W. J. Am. Chem. Soc. 1941, 63, 2376) 2-bromomandelonitrile.. 
(230 mg, 1 .08 mmol) was dissolved in cone. HC1 (1 mL) and stirred at room temperature for. 
1 8 h and then at 70 °C for 24 h. After cooling, the reaction mixture was extracted with diethyl 
ether (4x2 mL). Organic extracts were, combined, dried over MgS0 4 , filtered and 

20 concentrated in vacuo. 2-bromomandelic acid was isolated as a colorless powder (180 mg, 
0.78 mmol, 70 % yield). 

[000368] Preparation of racemic aryllactic acids standards from corresponding amino acids: 
Phenylalanine (10 mmol, 1.65g) was dissolved in 30 ml 2N H 2 S0 4 at room temperature under 
N 2 (g) atmosphere. Sodium nitrite (1 .4 g in 3 ml aqueous solution, 2 eq) solution was added 

Z5 slowly to the reaction mixture over a period of 3-4 hours with vigorous stirring at room 
temperature under N2 (g) atmosphere. The reaction mixture was stirred overnight and the 
phenyllactic acid product was then extracted into diethylether (3 x 30 ml). Combined ether 
extracts were dried over MgS0 4 and then filtered and concentrated in vacuo. (Kenji, L; 
Susumu, A.; Masaru, M.; Yasuyoshi, U.; Koki, Y.; Koichi, K. Patent Number, WO01 55074, 

10 Publication date: 2001-08-02.) 

General Method for Enzymatic Preparation of a-hydroxy acids: 

[000369] (*)-(-)-Mandelic Acid To a solution of mandelonitrile (1 .005 g, 7.56 mmol) in 
150 mL of sodium phosphate (100 mM) buffer at pH 8 with 10% v/v methanol, that had been 
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N 2 (g) sparged, at 37 °C, was added 9 mg of nitrilase 1 (normalized for nitrilase content). The 
reaction was conducted under N 2 (g) atmosphere on a rotating platform shaker. Reaction 
progress was monitored by withdrawing aliquots for HPLC analysis. After 3 h incubation, 
the reaction mixture was acidified to pH 2 with 1 N HC1 and extracted with diethyl ether (4 x 
50 ml). Organic fractions were concentrated in vacuo and then the residue was taken up in 
10% sodium bicarbonate solution. This aqueous solutions was then washed with diethyl ether 
(3 x 50 ml) and then acidified to pH 2 with 1 N HC1 and extracted with diethyl ether (3 x 50 
ml). Organic fractions were combined, washed with brine, dried over MgS0 4 , filtered and 
then concentrated in vacuo. (£)-(-)-Mandelic acid (933 mg, 6.22 mmol) was isolated as a 
colorless powder in 86 % yield. 'H NMR (DMSO-dg, 500 MHz) 5 12.6 (br, s, 1H) 7.41 (m, 
2H), 7.34 (m, 2H), 7.28 (m, 1H), 5.015 (s, 1H). ,3 C NMR DMSO-dfe 125 MHz) 5 
174.083, 140.216, 128.113, 127.628, 126.628, 72.359. MS calc'd for [QHgOj] 150.07, found 
150.9 (ESI +); ee = 98 % [HPLC]. [a] 20 S98 = -134.6 (c = 0.5, methanol). 
[000370] (-)-2-chloromandelic acid 'H NMR (DMSO-de, 500 MHz) 8 7.75 ( m, 1H), 7.44 
(m, 1H), 7.34 (m, 2H), 5.34 (s, 1H). I3 C NMR (DMSO, 298K, 125MHz) 8 173.070, 
137.985, 132.105, 129.399, 129.158, 128.705, 127.235. MS calc'd for [QH7CIO3] 186.0, 
found 185.0 (LC-MS-). ee = 96% [HPLC]. 92 % yield. [a] 20 598 = -137.6 (c= 0.5, ethanol). 
[000371J (-)-2-bromomandelic acid 'H NMR (DMSO-d,;, 500 MHz) 8 7.60 (d, J= 7.93, 
1H), 7.48 (m, 1H), 7.40 (m, 1H), 7.25 (m, 1H), 5.30 (s, 1H). 13 C NMR DMSO-d 6 , 125 
MHz) 8 172.994, 139.61, 132.355, 129.652, 128.753, 127.752, 122.681, 71.644. MS calc'd 
for [C 8 H 7 Br0 3 ] 230.0, found 230.9. ee = 96% [HPLC]. 92% yield. [a] 20 59 8= -1 16.4 (c= 0.5, 
ethanol). 

[000372] (-)-2-methylmandelic acid 'H NMR (DMSO-ds, 500 MHz) 8 1 1 .78 (bs, 1H) 7.38 
(m, 1H), 7.16-7.38 (m, 3H), 5.18 (s, 1H), 2.35 (s, 3H). 13 C NMR DMSO-ds, 125 MHz) 8 
174.229, 138.623, 135.649, 130.129, 127.491, 126.990, 125.698, 125.698, 69.733, 18.899. 
MS calc'd for [C9H10O3] 166.1, found 165.2. ee = 91 % [HPLC]. 86 % yield. [a] 20 598 = - 
164.4 (c = 0.5, ethanol). 

[000373] (-)-3-chloromandelic acid 'H NMR (DMSO-dg, 500 MHz) 8 7.46 (s, 1H), 7.36 (m, 
3H), 5.07 (s, 1H). ,3 C NMR (DMSO, 298K, 125MHz) 8 173.554, 142.685, 132.813, 130.069, 
127.568, 126.355, 125.289,71.659. MS calc'd for [QH7CIO3] 186.0, found 185.34 (MALDI 
TOF -). ee = 98 % [HPLC]. 70 % yield. [a] 20 S9S = -120.4 8 (c= 0.5, methanol). 
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[000374] (-)-3-bromomandelic acid J H NMR (DMSO-de, 500 MHz) 5 7.60 (s, 1H ), 7.49 
(m, 1H), 7.42 (m, 1H), 7.31 (m, 1H), 5.06 (s, 1H). 13 C NMR (DMSO, 298K, 125MHz) 8 
173.551, 142.917, 130.468, 130.379, 129.237, 125.687, 121.404, 71.605. MS calc'd for 
[C8H 7 Br0 3 ] 229.98, found 229.1 (LC-MS). ee = 98 % [HPLC]. 82 % yield. [ct] 20 598 = -84.8 
(c= 0.5, ethanoi). 

[000375] (-)-4-fluoromandelic acid *H NMR (DMSO, 298K, 500MHz) 8 12.65 (s, 1H), 
7.44 (m, 2H), 7.17 (in, 2H), 5.91 (s, 1H), 5.03 (s, 1H) 13 C NMR (DMSO, 298K, 125MHz) 5 
173.93, 162.57, 136.47, 128.61, 128.55, 114.96, 114.80, 71.61. MS calc'd for [CgH 7 F0 3 ] 
170.0, found 168.8. ee = 99% [HPLC]. *1% yield. [a] 20 5 9 8 = -152.8 (c= 0.5, methanol). 
[000376] (-)-l-naphthylglycolicacid 'HNMR(DMSO-d 6 ,500MHz)8 8.28-8.26 (m, 
1H), 7.87-7.93 (m, 2H), 7.47-7.58 (m, 4H), 5.66 (s,lH). ,3 C NMR DMSO-de, 125 MHz) 
8 174.288, 136.284, 133.423, 130.654, 128.353, 128.192, 125.926, 125.694, 125.613, 
125.266, 124.558, 70.940. MS calc'd for [C, 2 H,o0 3 ]: 202.21 found 201.37 (MALDI TOF -). 
ee = 95% [HPLC]. 90 % yield [a] 20 598 = -1 1 5.4 (c = 0.5, ethanol). 
[000377] (-)-2-naphthylglycolic acid 'H NMR (DMSO-d 6 , 500 MHz) 8 12.6 (bm, 1H), . 
7.88-7.93 (m,4H), 7.48-7.56 (m,3H), 5.20 (s,lH). 13 C NMR DMSO-de, 125 MHz) 8 
174.005,137.760,132.644,132.498, 127.811, 127.658, 127.506, 127.209, 125.993, 125.334, 
124.761, 72.472. MS calc'd for [Ci 2 H 10 O 3 ] 202.21, found 201.37 (MALDI TOF). ee = 98% 
[HPLC]. 68% yield. [afV =-11 5.4 (c = 0.5, ethanol). 

[000378] (-)-3-pyridylglycolic acid This Reaction was performed in 100 mM ammonium 
formate buffer at pH 8. To isolate the product, the reaction mixture was filtered through a 
10,000 MWCO membrane to remove enzyme and then concentrated in vacuo. J H NMR 
(DMSO-d6, 500 MHz) 8 8.56 (s, 1H), 8.36 (d, /= 4.57 Hz, 1H), 8.25 (s, 1H), 7.71 (m, 1H), 
7.25 (dd, /= 4.98, 4.80 Hz 1H), 5.45 (s, 1H). ,3 C NMR DMSO-de, 125 MHz) 8 165.911, 
147.862, 147.251, 139.118, 133.381, 122.746, 71.508. MS calc'd for [C 7 H 7 N0 3 ] 153.04, 
found 154.0 ((MALDI TOF). ee = 92% [HPLC], 84% yield, [oc] 20 598 = -65.2 (c = 0.5, H 2 0). 
[000379] (->3-thienylglycolic acid ] H NMR (DMSO-de, 500 MHz) 8 7.48 (m, 1H), 7.45 
(d, J= 2.81, 1H,), 7.10 (m, 1H), 5.09 (s, 1H), 3.33 (s, 1H) ,3 C NMR (DMSO, 298K, 
125MHz) 8 173.704, 141.109, 126.446, 126.042, 122.247, 68.915 MS calc'd for [CeHe0 3 S] 
158.00, found 157.224 (MALDI TOF). ee = 95 % [HPLC]. 70 % yield. [a] 20 598 = -123.2 8 
(c= 0.5, methanol). 
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[000380] (5)-(-)-phenyllactic acid 'H NMR (DMSO-de, 500 MHz) 8 7.28(m, 5H), 
4.17(dd, J= 4.5 Hz, J= 8.3 Hz, 1H), 2.98(dd, J= 4.5 Hz, J= 13.7 Hz, 1H), 2.79 (dd, J= 8.3 
Hz, J= 13.7 Hz, 1H). ,3 C NMR (DMSO, 298K, 125MHz) 8 178.16, 133.4, 129.27, 128.6, 
127.3, 70.45, 44.12. ee = 97 % [HPLC], 84 % yield. [a] 20 598 = -17.8 (<*= 0.5, methanol). 
[000381] (-)-2-methylphenyllactic acid 'H MMR (DMSO-de, 500 MHz) 5 7. 16 (m, 4H), 
4.47 (dd, J= 3.9 Hz, J= 8.8 Hz, 1H), 3.25(dd, /= 3.9Hz, 14.3 Hz, 1H), 2.94 (dd, J= 8.8 Hz, 
J= 14.3Hz), 2.35(s, 3H). 13 C NMR (DMSO, 298K, 125MHz) 8 178.61, 137.08, 134.74, 
130.80, 130.25, 127.44, 126.34, 70.93, 37.67, 19.79. MS calc'd [CoHnOj] 180.08, found 
180.0 (ESI +). 86 % yield, ee = 95 % [HPLC]. [a] 20 598 = -13.2 (c= 0.5, methanol). 
[000382] (-)-2-bromophenyllactic acid *H NMR (DMSO-de, 500 MHz) 8 728 (m, 4H), 
4.60(dd, J= 4.0 Hz, 7= 9.1 Hz, 1H), 3.45(dd, J = 4.0 Hz, /= 14.1 Hz, 1H), 3.04(dd, J= 8.0 
Hz,J= 14.1 Hz, 1H). 13 C NMR (DMSO, 298K, 125MHz) 8 178.70, 136.05, 133.21, 132.10, 
128.99, 127.72, 125.0, 70.04, 40.76. MS calc'd for [C9H9BTO3] 243.9, found 243.3 (ESI +). 
91 % yield, ee = 93 % [HPLC], [a] 20 598 = -17.6 (c= 0.5, methanol) 

[000383] (-)-2-fluorophenyllactic acid 'H NMR (DMSO-d 6 , 500 MHz) 8 7. 1 0 (m, 4H), 4.64 
(t, J= 6.8 Hz, 1H), 3.1 l(d, /= 6.8 Hz, 2H). ,3 C NMR (DMSO, 298K, 125MHz) 8 132.1 8, 
131.52, 129.66, 129.03, 128.07, 124.05, 115.8, 63.02, 44.79. MS calc'd for [C 9 H 8 FNO]: 
165.06, found 164.2 (ESI+). 91% yield ee = 88 % [HPLC]. [a] 20 598 = -14.0 (c= 0.5, 
methanol). 

[000384] (-)-3-methylphenyllactic acid ! H NMR (DMSO-d 6 , 500 MHz) 8 7. 1 8 (m, 1H), 
7.02 (m, 3H), 4.54 (dd, J = 4.6 Hz, J= 8.0 Hz, 1H), 3.06(dd, /= 4.54 Hz, J= 14.4 Hz, 1H), 
2.83(dd, J= 8.0 Hz, 7= 14.4 Hz, 1H), 2.36 (s, 3H). ,3 C NMR (DMSO, 298K, 125MHz) 8 
175.88, 163.80, 130.33, 130.09, 125.7, 116.68, 113.75, 71.31, 34.28. MS calc'd for 
[C10H1 ,NO] 161.08, found 162.2 (ESI +). 80 % yield, ee = 98 % [HPLC]. [a] 20 59g = -2.4 (c= 
0.5, methanol). 

[000385] (->3-fluorophenyllactic acid ! H NMR (DMSO-de, 500 MHz) 8 7.2 (m, 1H), 6.9 
(m, 3H), 4.56 (dd, 4.5 Hz, J= 7.9 Hz, 1H), 3.09(dd, J= 4.5 Hz, J= 14.1 Hz, 1H), 2.86 (dd, J 
= 7.9 Hz, J= 14.1 Hz, 1H). ,3 C NMR (DMSO, 298K, 125MHz) 8 175.88, 163.80, 130.33, 
130.09, 125.7, 116.68, 113.75, 71.31, 34.28. MS calc'd for [Q^AF] 184.05, found 184.1 
(ESI +). 82 % yield, ee = 97 % [HPLC]. [a] 20 598 = -5.2 (c= 0.5, methanol). 
[000386] (-)-l-napthyllactic acid ! H NMR (DMSO-de, 500 MHz) 8 8.57 (m, 1H), 8.21(m, 
1H), 8.08 (m, 1H), 7.61 (m, 4H), 4.64 (dd, 3.5 Hz, 8.5 Hz, 1H), 3.84 (dd, 7= 3.5 Hz, J = 14.5 
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Hz, 1H), 3.38 (dd, J= 8.5 Hz, J= 14.5 Hz, 1H) ,3 C NMR (DMSO, 298K, 125MHz) 8 177.7, 
140.31, 129.74, 129.24, 128.92, 128.26, 127.84, 125.63, 124.53, 124.05, 123.42, 70.58, 38.0. 
MS calc'd for [C 13 H„NO] 197.08, found 197.1(ESI +). 87 % yield, ee = 94 % [HPLC]. 
[ot] 20 598 = -16.2 (c= 0.5, methanol). 

5 [000387] (-)-2-pyridyllactic acid 'H NMR (DMSO-dfi, 500 MHz) 8 8.49 (m,lH), 7.62 (m, 
1H), 7.21 (m, 2H), 4.50 (t, J= 5.0 Hz, 1H), 3.01 (d, 7= 5.0 Hz, 2H). 13 C NMR (DMSO, 
298K, 125 MHz) 8 178.8, 159.79, 148.84, 136.89, 124.35, 121.75, 71.14, 44.09. MS calc'd 
for [C8H 9 N03]: 167.06, found 167.0. (ESI +). 62 % yield, ee = 94 % [HPLC], [a] 20 598 = - 
3.6 (c= 0.5, methanol). 

10 [000388] (-)-3-pyridyllactic acid 'HNMR^MSO-ds, 500 MHz) 8 8.43(m, 2H), 7.62(m, 
1H), 7.28(m, 1H), 4.57(t, 5.37Hz, 1H), 2.85(<L 5.37Hz, 2H). ,3 C NMR (DMSO, 298K, 125 
MHz) 8 176.6, 150.03, 147.12, 136.41, 129.45, 123.26, 61.56, 31.46 MS calc'd for 
[CgH 9 N0 3 ] 167.06, found 167.0 (ESI +). 59 % yield, ee = 94 % [HPLC]. [a] 20 598 = -4.0 (c 
= 0.5, methanol). 

15 [000389] (-)-2-thienyllactic acid ! H NMR (DMSO-d 6 , 500 MHz) 8 7.18(m, 1H), 6.94(m, 
lH),6.90(m, 1H), 4.49 (dd,J= 4.1 Hz,/=6.25Hz, 1H), 3.36 (dd, .7=4.1 Hz,7= 15.0Hz, 
1H), 3.26(dd, /= 6.25 Hz, J= 15.0 Hz, 1H). 13 C NMR (DMSO, 298K, 125MHz) 8 127.68, 
127.41, 125.58, 124.60, 118.70, 63.25, 44.84. MS calc'd for [C 7 H 7 NOS] 153.02, found 153.0 
(ESI +). 85 % yield, ee = 95 % [HPLC]. [<x] 20 598 = -13.0 (c= 0.5, methanol). 

20 [000390] (-)-3-thienyllactic acid J H NMR (DMSO-d 6 , 500 MHz) 8 7.30(m, 1H), 7.13(m, 
1H), 7.01(m, 1H), 4.50 (dd, /= 4.25 Hz, J= 6.5 Hz, 1H), 3.21(dd, 7= 4.25 Hz, J= 15.0 Hz, 
1H), 3.10 (dd, J= 6.5 Hz, 7= 15.0 Hz, 1H). 13 C NMR (DMSO, 298K, 125MHz) 8 127.50, 
136.09, 128.83, 126.24, 123.32, 70.65, 34.84. MS calc'd for [C 7 H 8 0 3 S] 172.02, found 172.1 
(ESI +). 81 % yield, ee = 96 % [HPLC]. [cc] 20 598 = -18.8 (c= 0.5, methanol). 
25 Enzymatic Hydrolysis of 3-Hvdroxvglutarvlnitrile: 

[000391] 3-Hydroxyglutarylnitrile (1.0 g, 9.0 mmol, 240 mM) was suspended in N 2 (g) 
sparged sodium phosphate buffer (37.5 mL, pH 7, 100 mM) at room temperature. Cell lysate 
(30 mg, normalized for nitrilase content) was added to bring the concentration to 0.8 mg/ml 
enzyme and the reaction was at shaken at 100 rpm, room temperature. Reaction progress was 
30 monitored by TLC (1 : 1 EtOAcrHexanes, R,=0.32, nitrile; Rf=0.0, acid) After 22 h, the 

reaction was acidified with 1M HC1. The reaction mixture was continuously extracted with 
diethyl ether. The acid product was isolated as a yellow oil (1.15 g, 98 % yield). ] H NMR 
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(DMSO, 298K, 500MHz) 8 12.32 (s, 1H), 5.52 (s, 1H), 4.10 (m, 1H), 2.70 (dd, 1H, J = 16.8, 
4.1 Hz), 2.61 (dd, 1H, J = 16.9, 6.3 Hz), 2.44 (dd, 1H, J = 15.4, 5.3 Hz), 2.37 (dd, 1H, J = 
15.6, 7.8 Hz). ,3 C NMR (DMSO, 298K, 125 MHz) 5 171.9, 1 18.7, 63.4, 41.2, 25.2 MS 
calc'd for [C5H7NO3]: 129.0, found 130.0 [M+H*], (ESI +). 

5 Preparation of (RU-1-Meth yl (3-0-rbenzo vll-4-cv aT1 nVhiit a iin a t P . 

[000392] Benzoyl chloride (0.068 ml, 0.752 mmol) was added to a stirred solution of (R)- 
methyl-(3-hydroxy-4-cyano)-butanoate (71.7 mg, O.SOlmmol) in pyridine (2.0 ml), at room 
temperature. After 19 hours, add an additional 0.5 equivalent of benzoyl chloride (0.023ml, 
0.251mmol). Reaction was complete at 23 h, as determined by TLC. Add 1ml H 2 0, extract 

10 with ether (3 x 10ml). Wash with brine (2 x 10ml). Dry combined aqueous extracts with 
MgS0 4 . Filter off drying agent and remove solvent by rotary evaporation. Purify by column 
chromatography (hexane:ethyl acetate [2:1]. Rotary evaporation of fractions yielded the 
product as a yellow oil (46 mg, 0.186 mmol, 37%). *H NMR (DMSO, 298K, 500MHz) 6 
7.96 (d, 2H, J = 7.8), 7.70 (t, 1H, J = 7.25), 7.56 (t, 2H, J = 7.8), 5.55 (m, 1H), 3.59 (s, 3H), 

15 3.13 (m, 2H), 2.90 (m, 2H). 13 C NMR (DMSO, 298K, 125MHz) 8 169.6, 164.5, 133.8, 129.3, 
128.9, 128.5, 117.3, 66.0, 51.8, 37.5, 22.2 MS calc'd for [C 13 H 13 N04]: 247.25, found 270.3 
[M+Na + ] ee = 95% [HPLC]. [a] 20 598 -32.4 (c = 0.5, CHCI3). 

Synthesis of raiRth yl-te-hvdroYY^vanolhutannate 

[000393] A 0.2 M solution of (i?>3-hydroxy-4-cyano-butanoic acid (50 mg, 0.387 mmol) in 
anhydrous ethanol (1 .94 mL) was prepared. The ethanol solution was added dropwise to 1 .0 
ml of a 50:50 (v/v) mixture of anhydrous 1 M HC1 ethereal solution and anhydrous ethanol 
over sieves. The reaction was stirred overnight at room temperature under N 2 (g) atmosphere. 
The reaction was monitored by TLC, (1 :1 EtOAc:Hexanes, R f = 0.45, ester; R f = 0.0, acid, 
stained with p-anisaldehyde). After 30 hrs, solvent was removed by rotary evaporation. The 
25 crude product was taken up in 25 mL ether, washed with 5 mL saturated bicarbonate and then 
5 mL brine. The organic extract was dried over MgSO* filtered and then concentrated in 
vacuo, yielding the product as a clear oiL 'H NMR PMSO, 298K, 500MHz) 8 5.60 (d, 1H, 
7= 5.58 Hz), 4.12 (m, 1H), 4.07 (q, 2H, J= 7.1), 2.66 (m, 2H), 2.47 (m, 2H), 1.87 (t, 3H, J= 
7.0). ,3 C NMR (DMSO, 298K, 125 MHz) 5 17021, 1 18.60, 63.40, 59.98, 41.10, 25.14, 
30 14.02. MS calc'd for [C 7 H„N0 3 ]: 157.1, found 158.2. [M+H+] 
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Example 13: Optimization Of Nitrilases For The Enantioselective Production Of 
fig)-2-ChloromandeIic Acid 



[000394] Chloromandelic acid has the structure: 



HQ H 




COOH 



[000395] Nitrilases were identified which selectively produced (R>2-chloromandelic 
acid from (2?,iS>2-chloromandelonitrile. Nitrilases were identified which were useful to 
improve the enantioselectivity of the enzymes and establishing the effects of process 
conditions on the enzymes. An examination of the reaction conditions for the enzymatic 
nitrile hydrolysis was carried out in order to improve the enantiomeric excess of the product. 
Additionally, further investigation into the effects of process conditions on the enzyme was 
performed. 



OH 



HQ H 



CN 




COOH 



2-chloromandelonitrile (/?)-2-chloromandelic acid 



[000396] In this aspect, the enantioselective production of (K>2-chloromandelic acid 
was the target. One enzyme, SEQ ID NOS:385, 386, was selected for further confirmation of 
its enantioselectivity on 2-chloromandelonitrile. SEQ ID NOS:385, 386 was shown to be 
stable to process components, with a half-life of 8 hours. The enzyme was inhibited by 2- 
chlorobenzaldehyde and a contaminant in the cyanohydrin substrate, 2-chlorobenzoic acid. 
The enzymatic reaction was scaled up to a substrate concentration of 45 mM 2- 
chloromandelonitrile. Over 90% conversion was obtained, wither of 97%. The chiral HPLC 
method was improved, to remove a contaminating peak that was present in the substrate. 
Improved accuracy in the determination of enantioselectivity was obtained using this method. 
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[000397] Nitrilases were screened against 2-chloromandelonitrile, with 31 nitrilases 
exhibiting activity on this substrate. High enantioselectivities were shown by 9 enzymes. 
The optimization of 5 of these enzymes was undertaken and one of them was identified as a 
candidate for the next stage of development. 
5 [000398] In an effort to improve the enantioselectivity of the selected enzymes for (R)- 
2-chloromandelic acid, a number of factors that are known to affect this property, together 
with the activity of the enzymes, were investigated. These included pH, temperature, buffer 
strength and addition of solvents to the reaction. Initially, 5 nitrilases were selected for these * 
studies, based on the high enantioselectivities obtained by these enzymes. These enzymes 
10 were: SEQ ID NOS:385, 386, SEQ ID NOS:197, 198, SEQ ID NOS:217, 218, SEQ ID 
NOS:55, 56, and SEQ ID NOS:167, 168. 
Effect of pH 

[000399] The enzymatic reactions were run at a range of pH values, from pH 5 to pH 9. 
An increase in both activity and enantioselectivity with increasing pH was observed for all of 
15 the enzymes. With the exception of SEQ ID NOS:385, 386, pH 9 (0.1 M Tris-HCl buffer) 
was determined as the optimum for activity and enantioselectivity. The optimum pH for SEQ 
ID NOS:385, 386 was pH 8 (0. 1 M sodium phosphate buffer). 
Effect of temperature 

[000400] The enzymes exhibited similar temperature profiles, with the highest activities 
20 being measured at 37°C and 45°C. Although the latter temperature resulted in higher 

conversions, the enantioselectivity of most of the enzymes showed a clear preference for the 
lower temperatures, with ee values being 10-20% lower when the temperature was raised 
above 37°C In the case of SEQ ID NOS:385, 386 a slight optimum for enantioselectivity 
was evident at 37°C. Therefore, this temperature was established as the optimum for 
25 hydrolysis of 2-chloromandelonitrile by these enzymes. 
Effect of enzyme concentration 

[000401] During the concurrent investigation into the enantioselective hydrolysis of 
phenylacetaldehyde cyanohydrin to L-phenyllactic acid, the concentration of the enzyme in 
the reaction was found to have a significant effect on the enantioselectivity of the reaction. 
30 This provided an indication that the enzymatic hydrolysis rate was faster than the rate of 
racemization of the remaining cyanohydrin in the reaction. On this basis, the effect of 
enzyme concentration on the enantioselectivity of the enzymes towards (R)-2- 
chloromandelonitrile was investigated. Enzymatic reactions were performed with the 
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standard concentration of enzyme (0.6 mg protein/ml), half the standard concentration and 
one-tenth of the standard concentration. 

[000402] The following Table indicates the highest conversions achieved for the 
reactions, with the corresponding ee. With the exception of SEQ ID NOS:385, 386, it 
appears that very little, if any, increased enantioselectivity is observed. Therefore, it appears 
that the rate of racemization of the remaining chloromandelonitrile is not a limiting factor to 
obtaining higher enantioselectivities. 
Investigation of other positive enzymes 

[000403] In addition to the enzymes in the above Table, a number of other nitrilases 
were screened for their enantioselectivities on 2-chloromandelonitrile. Some of these 
enzymes were newly discovered enzymes. Some were reinvestigated under conditions that 
have since been found to be optimal for these enzymes (pH 8 and 37°C). The results of this 
screening are shown below in the Table. 

Effect of co-solvent concentration 

[000404] The addition of methanol as a cosolvent in the enzymatic reactions was shown 
to enhance the ee. In order to establish the lowest level of methanol that could be added to 
the reactions, the enzyme reactions were performed at varying concentrations of methanol, 
ranging from 0-20% (v/v). No significant differences in enantioselectivity were evident 
between the various methanol concentrations. However, the ee in these reactions was 97- 
98%, while that of the control reaction, with no added methanol was 95-96%. While this 
difference in ee is small, the effect of the methanol was shown in more than one set of 
experiments during the course of this investigation and is therefore regarded as significant. 

Effect of reaction components on activity of SEP ID NOS:385. 386 
[000405] A vital part of an investigation into process optimization of an enzyme 
involves the determination of the effects of any compounds which could be present in the 
enzymatic reaction. For SEQ ID NOS:385, 386, these components were established as the 
starting material and equilibrium product of the cyanohydrin, 2-chlorobenzaldehyde; the 
product, 2-chloromandelic acid and the contaminant detected in the substrate, 2- 
chlorobenzoic acid. The addition of cyanide to the reaction was found to have no effect on 
the enzyme activity. The presence of trace amounts of triethylamine was also found to be 
tolerable to the enzyme. 
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[000406] The effect of the various reaction components on the activity of SEQ ED 
NOS.385, 386 was assessed by addition of various levels of possible inhibitors to the enzyme 
reaction. From these experiments, it appeared that both the aldehyde and its oxidation 
product, 2-chlorobenzoic acid were detrimental to enzyme activity. Approximately 70% and 
5 40% of the activity of SEQ ID NOS:385, 386 was lost upon addition of 5 mM 2- 
chlorobenzaldehyde or 5 mM 2-chlorobenzoic acid to the reaction, respectively. 
Scale-up hydrolysis of 2-chloromandelonitrile 

[000407] In order to confirm the conversion and enantioselectivity obtained by SEQ ID 
NOS:385, 386 for the production of (R>2-chloromandeUc acid, a larger scale reaction was 
10 performed and the product isolated from the aqueous mixture. The reaction was performed in 
a 20 ml reaction volume, with a substrate concentration of 45 mM 2-chloromandelonitrile. 
Complete conversion of the cyanohydrin was obtained, with 30 mM product formed. The ee 
of the product was 97% and the specific activity of the enzyme was 0.13 mmol product/mg 
nitrilase/h. 

1 5 [000408] It is evident from this experiment, together with the other experiments 

performed, that the formation of product does not account for the complete loss of substrate. 
In all experiments, a nitrile-containing control sample was run, in order to determine the 
extent of breakdown of the cyanohydrin. Overall, it appears that approximately 50% of the 
substrate is lost over aperiod of 4 hours at 37°C. It is expected that this breakdown would be 

20 to its equilibrium products, cyanide and 2-chlorobenzaldehyde, which could undergo further 
oxidation. A larger scale reaction was also run at a substrate concentration of 90 mM 2- 
chloromandelonilrile. However, no product was detected in this reaction. At higher substrate 
concentrations, it is expected that the concentration of the equilibrium product, 2- 
chlorobenzaldehyde and the contaminant, 2-chlorobenzoic acid will be present in higher 

25 amounts. Based on the results above, it is possible that the enzyme will be completely 
inhibited under such conditions. 
Reactions under biphasic conditions 

[000409] The use of biphasic systems can facilitate product recovery following the 
enzymatic reaction step. These systems can be also be used for the removal of products or 
30 by-products which are inhibitory to the enzyme. The nitrilases were shown to be active under 
biphasic conditions using a variety of solvents. Following the low conversions obtained at 
the higher substrate concentration above, further investigation of a biphasic system was 
performed with the hit enzyme, SEQ ID NOS:385, 386. It was important to ascertain 
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whether any inhibitory factors could be removed by the solvent phase and whether any 
process advantages could be gained by the use of a biphasic system. 
[00041 0J Promising results were obtained with hexane as the organic phase. Therefore, 
further investigations involved the use of this solvent at two different levels: 100% and 70% 
of the volume of the aqueous phase, with increasing substrate concentrations, up to 90 mM. 
The substrate was dissolved in the organic phase. The level of hexane did not appear to 
affect the level of product formation, particularly at the higher concentrations of 2- 
chloromandelonitrile. 

[00041 1] Once again, high conversion was observed in a biphasic system, with a 76% 
yield of product being observed after 5 hours. The rate of product formation appeared to be 
slightly lower than in the corresponding monophasic system, where the reaction is complete 
within 1 hour. Lower enantioselectivity was observed in the biphasic system. Some 
possibilities which may account for these results are (i) the mass transfer rate is lower than 
the rate of enzyme activity or (ii) the non-polar solvent directly affects the enzyme. 
[000412] At a higher substrate concentration, a very low conversion was observed, with 
7 mM 2-chloromandelic acid being formed from 90 mM 2-chloromandelonitrile. This level 
of conversion, albeit low, was higher than that observed in the monophasic system with the 
same substrate concentration. These results suggest that some of the inhibitory 2- 
chlorobenzaldehyde or 2-chlorobenzoic acid is retained in the non-polar organic solvent. 
Standard assay conditions: 
[000413] The following solutions were prepared: 

- Substrate stock solution: 50 mM of the cyanohydrin substrate in 0.1 M phosphate buffer 
(pH8). 

- Enzyme stock solution: 3.33 ml of 0.1 M phosphate buffer (pH 8) to each vial of 20 mg of 
lyophilized cell lysate (final concentration 6 mg protein/ml) 

[000414] The reaction volumes varied between the different experiments, depending on the 
number of time points taken. Unless otherwise noted, all reactions consisted of 25 mM 2- 
chloromandelonitrile and 10% (v/v) of the enzyme stock solution (final concentration 0.6 mg 
protein/ml). The reactions were run at 37°C, unless otherwise stated. Controls to monitor the 
nitrile degradation were run with every experiment. These consisted of 25 mM 2- 
chloromandelonitrile in 0.1 M phosphate buffer (pH 8). 

[000415] Sampling of reactions: The reactions were sampled by removing an aliquot from 
each reaction and diluting these samples by a factor of 8. Duplicate samples were taken for 
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analysis by chiral and achiral HPLC methods. The reactions were sampled at 0.5, 1, 1.5, 2, 3, 
and 4 hours, unless otherwise shown in the figures above. 
HPLC methods 

The achiral HPLC method was run on a SYNERGI-RP™ column (4 \xm; 50x2 mm) 
5 with a mobile phase of 10 mM Na phosphate buffer (pH 2.5). A gradient of methanol was 
introduced at 3.5 min and increased to 50% over 1.5 min, following which the methanol was 
decreased to 0%. Elution times for 2-chloromandelic acid and 2-chloromandelonitrile were 
2.5 and 6.1 minutes, with another peak appearing with the nitrile at 5.9 minutes. 
[000416] As described above, the chiral HPLC method was optimized during the course 

10 of the investigation, to improve the separation between 2-chlorobenzoic acid and (S)-2- 
chloromandelic acid. The optimized method was used during the latter half of the 
investigation and was run on a CfflROBIOTIC-R™ column. The mobile phase was 80% 
Acetonitrile:20% of 0.5% (v/v) acetic acid. Elution times for (S)-2-chloromandelic acid and 
(R)-2-chloromandelic acid were 2.4 and 3.5 minutes respectively. A peak for 2- 

15 cWorobenzoic acid eluted at 1 .9 minutes. For each experiment, a standard curve of the 
product was included in the HPLC run. The concentration of product in the samples was 
calculated from the slope of these curves. 
Effect of pH 

[000417] The effect of pH on the enzyme activity and enantioselectivity was studied by 
20 performance of the standard assay in a range of different buffers: 0. 1 M Citrate Phosphate pH 
5; 0.1 M Citrate Phosphate pH 6; 0.1 M Sodium Phosphate pH 6; 0.1 M Sodium Phosphate 
pH 7; 0.1 M Sodium Phosphate pH 8; 0.1 M Tris-HCl pH 8; and 0.1 M Tris-HCl pH 9. The 
standard enzyme concentration was used for all enzymes, with the exception of SEQ ID 
NOS:385, 386, where half the standard concentration was used (5% v/v of the enzyme stock 
25 solution). 

Effect of temperature 

[000418] The effect of temperature on the activity and enantioselectivity was 
investigated by performing the standard assay at a range of different temperatures: room 
temperature, 37°C, 45°C, 50°C and 60°C. The standard enzyme concentration was used for 
30 all enzymes, with the exception of SEQ ID NOS:385, 386, where half the standard 
concentration was used (5% v/v of the enzyme stock solution). 
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Effect of enzyme concentration 

[000419] Reactions were run under standard conditions, with varying enzyme 
concentrations: 1%, 5% and 10% (v/v) of the enzyme stock solution. The reaction volume 
was normalized with the appropriate buffer. 
5 Addition of solvents 

[000420] The enzyme reactions were performed in the presence of methanol as a 
cosolvent. Methanol was added to the standard reaction mixture at the following levels: 0, 5, 
10, 15 and 20% (v/v). 

[000421] Biphasic reactions with hexane were also investigated. The aqueous phase 

1 0 contained 1 0% (v/v) of the enzyme stock solution in 0. 1 M phosphate buffer (pH 8). The 
cyanohydrin was dissolved in the hexane, prior to addition to the reaction. Two levels of 
organic phase were used: 1 equivalent and 0.7 equivalents of the aqueous phase volume. In 
addition, a range of nitrile concentrations was investigated: 25, 45 and 90 mM. These 
reactions were run at room temperature. 

1 5 [000422] Samples from these reactions were taken both from the aqueous and the 

solvent phase. The hexane was evaporated by centrifugation under vacuum and redissolved 
in a 50:50 mixture of methanol and water, so that the samples were at the same dilution as the 
aqueous samples. Analysis of the samples was performed by non-chiral and chiral HPLC. 
Effect of process components 

20 [000423] (i) Activity: The effect of the process components on the activity of the 

enzymes was established by addition of the individual components, 2-chlorobenzaldehyde, 2- 
chlorobenzoic acid or 2-chloromandelic acid, to the enzymatic reaction. The enzymatic 
reactions were carried out under standard conditions, in the presence of one of the 2 possible 
inhibitors as follows: 5, 10, 20 and 25 mM 2-chlorobenzaldehyde; 1.5 and 5 mM 2- 

25 chlorobenzoic acid; and 10, 20, 40 and 80 mM 2-chloromandelic acid. Control reactions 
were performed under standard conditions, with no additive. At each of the sampling times, 
the samples were diluted to a level of 1 in 10. Control samples containing the reaction 
components without enzyme were used and diluted to the same level. The samples were 
analysed by non-chiral HPLC. 

30 [000424] (ii) Stability: The stability of the enzymes to process conditions was monitored 
by incubation of the enzymes in the presence of the reaction components, 2- 
chlorobenzaldehyde and 2-chloromandelic acid for predetermined time periods, prior to assay 
of the enzyme activity under standard conditions. In these experiments, the enzymes were 
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incubated at a concentration of 3 mg protein/ml in the presence of each of the following 
reaction components: 5, 10, 20 and 25 mM 2-chlorobenzaldehyde; and 10, 20, 40 and 80 mM 
2-chloromandelic acid. Control reactions were performed by incubation of the enzyme in 
buffer only. 

[000425] Assay conditions : At 0, 4, 8 and 24 hours of incubation in the particular 
additive, 20 ^1 of the enzyme solution was removed and added to 60 ^1 of a 41.6 mM 
substrate stock solution and 20 p] buffer. The enzyme activity was thus assayed under 
standard conditions. The reactions were sampled 90 minutes after substrate addition and 
analyzed using the noh-chiral HPLC method. 
Scale-up of enzymatic reaction 

[000426] The enzymatic reactions were run at two difference concentrations: 45 mM 
and 90 mM substrate. The reactions were run under standard conditions, i.e. pH 8 (0.1 M 
sodium phosphate buffer), 37°C and 10% (v/v) of the enzyme stock solution. The substrate 
was dissolved in 10% (v/v/) methanol prior to addition of the buffer. The final reaction 
volume was 20 ml and the reactions were performed with magnetic stirring. 

Example 14: Optimization Of Nitrilases For The Enantioselective Production Of 
L-2-amino-6,6-dimethoxvhexanoic acid 



HCN, NH 4 + Me0 | H 2 N CN OMe 





Meo > "^'^^ CHO 0M * ~ H OMe, 

5,5-dimethoxypentanal 5,5-dimethoxypentanal L-2-amino-6,6- 

aminonitrile dimethoxyhexanoic 

acid 

[000427] Four of the isolated enzymes were shown to hydrolyze 2-amino-6-hydroxy 
hexanenitrile to (S)-2-amino-6-hydroxy hexanoic acid, with selectivity towards the L- 
enantiomer. A new target, with a similar structure to (S)-2-amino-6-hydroxy hexanoic acid 
was identified. A panel of the isolated nitrilases are screened against the target, 5,5- 
dimethoxypentanal aminonitrile. The positive enzymes are characterized on this substrate. 
Laboratory evolution techniques can be used to optimize these nitrilases for improved 
enantiospecificity towards the specified target. A primary screen is used to identify putative 
up-mutants, which is confirmed using HPLC. 

[000428] Optimization of enzymes: GSSM™ and GeneReassembly™ can be performed 
on selected nitrilases, in order to improve the enantioselectivity and activity of the enzymes 
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for the production of L-2-amino-6,6-dimethoxyhexanoic acid. Four enzymes were identified 
that can hydrolyze enantioselectively 2-amino-6-hydroxy hexanenitrile to L-(S)-2-amino-6- 
hydroxy hexanoic acid. However, a slight structural difference is present in the new target 
molecule, L-2-amino-6,6-dimethoxyhexanoic acid. In order to determine whether this 
5 difference affects the activity and enantioselectivity of the enzymes, the complete spectrum 
of nitrilases is screened against the new target. 

[000429] An enzyme exhibiting the highest combination of activity and 
enantioselectivity for the production of L-2-amino-6,6-dimethoxyhexanoic is selected for 
GSSM™. Following the mutation of the target enzyme, the resulting mutants will be 

10 screened on 5,5-dimethoxypentanal aminonitrile, using high throughput screening 

technology. Following confirmation of the up-mutants by HPLC analysis, the individual up- 
mutants will be combined in order to further enhance the properties of the mutant enzymes. 
[000430] In parallel to GSSM™, a GeneReassembly™ can be performed on a 
combination of parent enzymes, at least one of which can be selected for activity and 

1 5 enantioselectivity on L-2-amino-6,6-dimethoxyhexanoic acid. At least two other nitrilases, 
with a high degree of homology, can be reassembled with the former enzyme(s); these 
enzymes will be selected in order to provide diversity to the reassembled sequences. 
[000431] Crucial to the success of this evolution effort is the development of a high 
throughput assay for enantioselectivity. Such an assay is a novel enzyme-based 

20 enantioselectivity assay that allows for the screening of >30,000 mutants in a significantly 
shorter time period than the traditionally used method of HPLC. 

[000432] In one aspect, a non-stochastic method, termed synthetic ligation reassembly, 
that is related to stochastic shuffling, except that the nucleic acid building blocks are not 
shuffled or concatenated or chimerized randomly, but rather are assembled non- 
25 stochastically, can be used to create variants. This method does not require the presence of 
high homology between nucleic acids to be shuffled. The ligation reassembly method can be 
used to non-stochastically generate libraries (or sets) of progeny molecules having at least 
10 100 or at least 1 0 1000 different chimeras. The ligation reassembly method provides a non- 
stochastic method of producing a set of finalized chimeric nucleic acids that have an overall 
30 assembly order that is chosen by design, which method is comprised of the steps of 

generating by design a plurality of specific nucleic acid building blocks having serviceable 
mutally compatible ligatable ends, as assembling these nucleic acid building blocks, such that 
a designed overall assembly order is achieved. 
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[000433J The mutually compatible ligatable ends of the nucleic acid building blocks to 
be assembled are considered to be "serviceable" for this type of ordered assembly if they 
enable the building blocks to be coupled in predetermined orders. Thus, in one aspect, the 
overall assembly order in which the nucleic acid building blocks can be coupled is specified 
5 by the design of the ligatable ends and, if more than one assembly step is to be used, then the 
overall assembly order in which the nucleic acid building blocks can be coupled is also 
specified by the sequential order of the assembly step(s). In a one aspect of the invention, the 
annealed building pieces are treated with an enzyme, such as a ligase (e.g., T4 DNA ligase) to 
achieve covalent bonding of the building pieces. 

1 0 [000434] In a another aspect, the design of nucleic acid building blocks is obtained upon 
analysis of the sequences of a set of progenitor nucleic acid templates that serve as a basis for 
producing a progeny set of finalized chimeric nucleic acid molecules. These progenitor 
nucleic acid templates thus serve as a source of sequence information that aids in the design 
of the nucleic acid building blocks that are to be mutagenized, z.e. chimerized, recombined or 

15 shuffled. 

[000435] In one exemplification, the invention provides for the chimerization of a 
family of related genes and their encoded family of related products. In a particular 
exemplification, the encoded products are nitrilase enzymes. Nucleic acids encoding the 
nitrilases of the invention can be mutagenized in accordance with the methods described 
20 herein. 

[000436] Thus, according to one aspect of the invention, the sequences of a plurality of 
progenitor nucleic acid templates encoding nitrilases are aligned in order to select one or 
more demarcation points, which demarcation points can be located at an area of homology. 
The demarcation points can be used to delineate the boundaries of nucleic acid building 

25 blocks to be generated. Thus, the demarcation points identified and selected in the progenitor 
molecules serve as potential chimerization points in the assembly of the progeny molecules. 
[000437] Typically a serviceable demarcation point is an area of homology (comprised 
of at least one homologous nucleotide base) shared by at least two progenitor templates, but 
the demarcation point can be an area of homology that is shared by at least half of the 

30 progenitor templates, at least two thirds of the progenitor templates, at least three fourths of 
the progenitor templates, and preferably at almost all of the progenitor templates. Even more 
preferably still a serviceable demarcation point is an area of homology that is shared by all of 
the progenitor templates. 
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[000438] In a one aspect, the ligation reassembly process is performed exhaustively in 
order to generate an exhaustive library. In other words, all possible ordered combinations of 
the nucleic acid building blocks are represented in the set of finalized chimeric nucleic acid 
molecules. At the same time, the assembly order (i.e., the order of assembly of each building 
5 block in the 5 1 to 3 ! sequence of each finalized chimeric nucleic acid) in each combination is 
by design (or non-stochastic, non-random). Because of the non-stochastic nature of the 
method, the possibility of unwanted side products is greatly reduced. 
[000439] In another aspect, the method provides that, the ligation reassembly process is 
performed systematically, for example in order to generate a systematically 

1 0 compartmentalized library, with compartments that can be screened systematically, e.g. , one 
by one. Each compartment (or portion) holds chimeras or recombinants with known 
characteristics. In other words the invention provides that, through the selective and 
judicious use of specific nucleic acid building blocks, coupled with the selective and 
judicious use of sequentially stepped assembly reactions, an experimental design can be 

15 achieved where specific sets of progeny products are made in each of several reaction 
vessels. This allows a systematic examination and screening procedure to be performed. 
Thus, it allows a potentially very large number of progeny molecules to be examined 
systematically in smaller groups. 

[000440] Because of its ability to perform chimerizations in a maimer that is highly 
20 flexible, yet exhaustive and systematic, particularly when there is a low level of homology 
among the progenitor molecules, the invention described herein provides for the generation 
of a library (or set) comprised of a large number of progeny molecules. Because of the non- 
stochastic nature of the ligation reassembly method, the progeny molecules generated 
preferably comprise a library of finalized chimeric nucleic acid molecules having an overall 
25 assembly order that is chosen by design. In a particularly aspect, such a generated library is 
comprised of greater than 10 3 to greater than 10 1000 different progeny molecular species. 
[000441] In another exemplification, the synthetic nature of the step in which the 
building blocks are generated allows the design and introduction of nucleotides (e.g., one or 
more nucleotides, which may be, for example, codons or introns or regulatory sequences) that 
30 can later be optionally removed in an in vitro process (e.g., by mutageneis) or in an in vivo 
process (e.g., by utilizing the gene splicing ability of a host organism). It is appreciated that 
in many instances the introduction of these nucleotides may also be desirable for many other 
reasons in addition to the potential benefit of creating a serviceable demarcation point. 
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[000442] The synthetic ligation reassembly method of the invention utilizes a plurality 
of nucleic acid building blocks, each of which preferably has two ligatable ends. The two 
ligatable ends on each nucleic acid building block may be two blunt ends (i.e. each having an 
overhang of zero nucleotides), or preferably one blunt end and one overhang, or more 
5 preferably still two overhangs. On a double-stranded nucleic acid, a useful overhang can be a 
3' overhang, or a 5* overhang. A nucleic acid building block can have a 3' overhang, a 5' 
overhang, two 3' overhangs, or two 5* overhangs. The overall order in which the nucleic 
acid building blocks are assembled to form a finalized chimeric nucleic acid molecule is 
determined by purposeful experimental design (e.g., by designing sticky ends between 
10 building block nucleic acids based on the sequence of the 5 1 and 3' overhangs) and is not 
random. 

[000443] According to one preferred aspect, a nucleic acid building block is generated 
by chemical synthesis of two single-stranded nucleic acids (also referred to as single- 
stranded oligos) and contacting them together under hybridization conditions so as to allow 

15 them to anneal to form a double-stranded nucleic acid building block. A double-stranded 
nucleic acid building block can be of variable size. The sizes of these building blocks can be 
small or large. Preferred sizes for building block range from 1 base pair (not including any 
overhangs) to 1 00,000 base pairs (not including any overhangs). Other preferred size ranges 
are also provided, which have lower limits of from 1 bp to 10,000 bp (including every integer 

20 value in between), and upper limits of from 2 bp to 100, 000 bp (including every integer 
value in between). 

[000444] According to one aspect, a double-stranded nucleic acid building block is 
generated by first generating two single stranded nucleic acids and allowing them to anneal to 
form a double-stranded nucleic acid building block. The two strands of a double-stranded 

25 nucleic acid building block may be complementary at every nucleotide apart from any that 
form an overhang; thus containing no mismatches, apart from any overhang(s). According to 
another aspect, the two strands of a double-stranded nucleic acid building block are 
complementary at fewer than every nucleotide apart from any that form an overhang. Thus, 
according to this aspect, a double-stranded nucleic acid building block can be used to 

30 introduce codon degeneracy. Preferably the codon degeneracy is introduced using the site- 
saturation mutagenesis described herein, using one or more N,N,GIT cassettes or 
alternatively using one or more N,N,N cassettes. 

Example 15: Assays for Evaluation of Nitrilase Activity and Enantioselectivitv 
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[000445] An assay method amenable to high throughput automation to increase the 
screening throughput both of the discovery and evolution efforts for nitrilases is described. 
The ideal assay is one that permits quantification of both product fomiation or substrate 
conversion and also enantiomeric excess. Two achiral and two chiral colorimetric assays that 
are amenable to high throughput screening were developed. 

Achiral Colorimetic Assays Developed: 

[000446] OPA assay for residual substrate. The OPA assay is Applicable to a-amino or 
a-hydroxy nitrile substrates. The lysis of whole cells is not necessary. These results were 
corroborated by HPLC for 2-chloromandelonitrile and phenyl acetaldehyde cyanohydrin. 
The assay works best with aromatic nitriles. Aliphatic compounds exhibit a linear standard 
curve, fluorescence is reduced, reducing the efficacy of the assay. 

[000447] LDH Assay for quantific ation and ee determination of hvdroxvacid formed. 
The LDH assay is applicable to phenyl lactic acid but not to 2-chloromandelic acid. Use of a 
resazurin detection system increases sensitivity and reduces background. Background 
fluorescence of whole cells was overcome either by centrifugation or heat inactivation prior 
to performing assay. 

[000448] AAO Assay for quantification and ee determination of aminoacid formed. The 
AAO assay is applicable to phenylalanine and (S)-2-amino-6-hydroxy hexanoic acid. The use 
of the Amplex Red detection system increases sensitivity. Cell lysis was shown not be 
necessary. Cells are grown in defined media in order to prevent background fluorescence. 
OPA Assay 

[000449] The ophthalaldehyde (OPA) fluorescence based nitrilase assay is used to 
quantify the amount of a-hydroxynitrile substrate remaining. OPA reacts with the cyanide 
released from the pH controlled decomposition of a-hydroxynitriles to the corresponding 
aldehyde and cyanide to yield a fluorescent, quantifiable product. OPA reacts with the 
cyanide released from the pH controlled decomposition of a-hydroxynitriles to the 
corresponding aldehyde and cyanide to yield the fluorescent l-cyano-2-R benzoisoindole. 
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EX A, 320 nm, EMX 380 nm 
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[000450] Standard curves were established for the following substrates: 2- 
Chloromandelonitrile (CMN, 0.998), Cyclohexylmandelonitrile (CHMN, 0.99), 
Acetophenone aminonitrile (APA, 0.99), and Phenylacetaldehyde cyanohydrin (PAC, 0.97), 
(Figure 5), (R 2 values in parentheses). A standard curve for Phenylglycine (PGN, 0.93) was 
also established. Three of the substrates tested, Dimethylbutanal aminonitrile (DMB) (2- 
amino-4,4-dimethyl pentanenitrile), Hydroxypivaldehyde aminonitrile (HP A) and 
Pivaldehyde aminonitrile (PAH), gave very low fluorescence readings and unreliable results 
under the original assay conditions. For these compounds a number of parameters where 
adjusted, however the fluorescent signal strength of these compounds was not increased by 
these manipulations. 

[000451] In an attempt to increase the fluorescent signal of these three compounds, 
naphthalene dicarboxaldehyde (NDA) was substituted for OPA. Standard curves for PAH, 
HPA and DMB with either OPA or NDA were constructed. To determine sensitivity and 
background fluorescence, a lyophilized nitrilase lysate (SEQ ID NOS:189, 190) with 
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suspected catalytic activity on each of the substrates was added. Hydrolysis was detected in 
three out of four of the compounds. NDA sharply boosted the signal, often by an order of 
magnitude, though this reduced linearity is presumably due to signal saturation. 
[000452] NDA was established as an alternative detection reagent for the aliphatic 
5 compounds. However, it is desirable for the assay to utilize the same detection system for all 
of the substrates since this would facilitate the automated evaluation of multiple nitrilase 
substrates. The current OPA based assay is effective for the analysis of PAC, CMN, CHMN, 
APA, MN and PGN. While standard curves have been developed for the aliphatic 
compounds PAH, HP A, and DMB. 

10 Whole cell optimization 

[000453] The effect of addition of lyophilized nitrilase lysate to the assay components, 
either untreated or heat inactivated, was evaluated. Interfering background fluorescence was 
not observed in either case. The OPA assay was next evaluated and optimized for nitrilase 
activity detection in a whole cell format. Both nitrilase expressing whole cells and iw-situ 

15 lysed cells were evaluated. Lyophilized cell lysates were evaluated alongside their respective 
whole cell clones as controls. For this optimization study, mandelonitrile (MN) was chosen as 
a model substrate. 

[000454] The lyophilized cell lysate of SEQ ID NOS:187, 188 was evaluated alongside 
whole cells expressing SEQ ID NOS: 1 87, 1 88 and in situ lysed cells expressing SEQ ID 

20 NOS:l 87, 188 The addition of whole cells did not affect fluorescence nor result in 
fluorescence quenching. Addition of any of the three cell lysis solutions improved 
permeability (and therefore conversion) of mandelonitrile in the whole cell systems. Three 
cell lysing solutions were evaluated: B-PER (Pierce), BugBuster (Novagen) and CelLytic fi- 
ll (Sigma) and were found not to have a deleterious affect on the OPA assay. The addition of 

25 product a-hydroxyacid or a-aminoacid did not affect detection by the OPA assay. 

[000455] The assay was modified from its original format, which required several liquid 
transfer steps, into a one plate process, where cell growth, nitrile hydrolysis and OPA assay 
reaction occurred in the same microtiter plate. Mandelonitrile was tested using this single 
well format. In this case, the E. colL Gene site-saturation mutagenesis (GSSM™ ) cell host 

30 was evaluated. Three clones were tested: SEQ ID NOS:101, 102, SEQ ID NOS:187, 188, 
and an empty vector, which was used as a control. Hydrolysis was evaluated at four 
timepoints, at 10 and 20 mM, and also with a 0 mM control. In an earlier experiment, clone 
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SEQ ID NOS: 1 87, 1 88 was evaluated against the phenylacetaldehyde cyanohydrin substrate 
(for which this enzyme does not exhibit activity), and no activity was observed. 
[000456] The OPA assay was found to detect the presence of both a-hydroxy and a- 
amino nitrile substrate. Aromatic compounds were readily detectable with the assay, while 
5 aliphatic compounds posed some detection challenges. No background issues were evident 
when using lyophilized cell lysates, in-situ lysed whole cells or unlysed whole cells. The 
assay is amenable to one-plate analysis, where cells are grown, incubated with the substrate, 
and assayed on the same plate: no liquid transfers are required, easing automation. While all 
nitriles tested produced a linear response, aliphatic compounds gave a low fluorescent 
10 response. 

ChiralLDH Assay 

[000457] A spectroscopic system based on lactate dehydrogenase (L-LDH) was 
developed for the analysis of the chiral a-hydroxy acids which are generated by the nitrilase 
catalyzed hydrolysis of cyanohydrins. The hydroxynitrile substrate is not metabolized by the 
15 secondary or detection enzyme and thus starting material does not interfere. Cell lysate 
which is not heat treated results in background activity for the LDH system; however, heat 
inactivation or pelleting of the cell lysates eliminates the background activity. (See Figure 
4.) 

[000458] The activity and enantiomeric specificity of commercially available D- and L- 
lactate dehydrogenases against the nitrilases disclosed herein was evaluated. An LDH was 
identified which is suitable to both D- and L-phenyl lactic acid analysis. An enzyme suitable 
for 2-Chloromandelic acid analysis was not found. The chosen LDH enzymes exhibited 
virtually absolute stereoselectivity. The viability of the assay to detect D- and L-LDH 
produced from PAC using lyophilized cell lysate was established. 
[000459] Originally, three colorimetric dyes were evaluated, all of which are 
tetrazolium salts: NBT (3,3 , -dimethoxy-4,4 , -biphenylene)bis[2,(4-nitrophenyl) -5-phenyl- 
2H]-, chloride) MTT (3-(4,5-dime%ltmazol-2-yl)-2,5-diphenyltetrazoUum bromide) INT 
(2-(4-Iodophenyl)-3-(4-nitrophenyl>-5-phenyl- 2H-tetrazolium chloride). The insolubility of 
the product of these detection system posed an analytical challenge. To address this, another 
tetrazolium salt with a reportedly soluble product, XTT (2,3-Bis-(2-methoxy-4-nitro-5- 
sulfophenyl) -2ff-tetrazolium-5-carboxanilide, was evaluated. While XTT yielded a soluble 
bright red product, the substrate was insoluble which thus effected the same analytical 
challenges. As an alternative to the tetrazolium family of dyes, the dual 
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colorimetric/fluorometric dye resazurin was evaluated. Oxidation of resazurin produces 
resourfin. Both substrate and product are soluble, and the color change can be quantified 
colorimetrically or fluorimetrically, increasing accuracy. Due to the sensitivity of resazurin, 
0.05 mM of lactic acid can be quantified. Optimal results were obtained when using the dye 
in the same range as the substrate, e.g. 0.5 mM resazurin can quantify a range of lactic (and 
analogs) from 0.05 to 0.5, though the best linearity is at the lower end of this scale. Resourfin 
was stable over 28 hours, and had a linear fluorescent response. 
[000460] In the presence of the LDH assay components, lyophilized enzyme gave 
background fluorescence/absorption. To address this problem the lysate was boiled for 10 
minutes and then centrifuged. This resulted in a 90% decrease in background signal. 
Interestingly, both centrifugation alone (5 minutes @ 14.1 rcf) or boiling followed by 
centrifugation (5 minutes @ 100°C) reduced the fluorescence to background levels. Li a 
high-throughput format such as 1536 well plates, spinning would be preferable to boiling, as 
boiling would increase evaporation (8 ul well size) and potentially volatize the nitrile 
substrates. No background signal resulting from growth media (LB and TB and M9) or cell 
lytic solutions (B-PER, CelLytic and BugBuster) was noted. 



Chiral AAO Assay 

[000461] A spectroscopic system based on amino acid oxidase (AAO) was developed 
for the analysis of the chiral a-amino acids which are generated by the nitrilase catalyzed 
hydrolysis of amino nitriles. 
Assay Development and Validation 

[000462] The initial assay validation utilized the 2,2 , -azmo-di-{3-ethylbenzothiazoline- 
6-sulfonic acid (ABTS) detection system as outlined above. However, since the color was not 
stable further investigations utilized the phenol amino antipyrine (PAAP) detection system 
which is analyzed at X max 510nm. Enzymes with suitable activity were found for each 
enantiomer of 4-methyl- leucine, phenylalanine, (S)-2-armno-6-hydroxy hexanoic acid, and 
tert-leucine. The assay is not applicable to methylphenylglycine and does not work well with 
phenylglycine. 

[000463] Standard curves were generated for phenylalanine from 0-15 mM. The curve 
is much more linear when the concentrations remained below 1 mM. The color remains 
stable for several days as long as it is kept in the dark. Three cell lysing solutions Bug Buster 
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(BB), Bacterial Protein Extracting Reagent (BPER), and Cell Lytic Reagent (CLR) were 
added to the standard curve and shown to have no affect on color development. The addition 
of cell lysate (cl) did not exhibit background color formation. Addition of the 
phenylacetaldehyde aminonitrile sulfate (PAS) starting material also showed no effect on 
5 color formation. 

[000464] The AAO system exhibits greater linearity at up to 1 mM substrate. The 
concentration of the AAO enzymes and of the acid substrate were adjusted to try to move the 
intersection of the L-AAO and D-AAO curves closer to the middle of the graph. Premixing 
the PAAP, the HRP, and the AAO was demonstrated to be effective and caused no change in 
10 observed activity establishing that the assay components may be added to the assay in a 
cocktail format. 

[000465] A high level of background was observed for the AAO assay of whole cells 
and this was attributed to the L-amino acids present in the TB and LB growth media. 
Washing and resuspension of the cells in M9 media eliminated background. For all future 
15 experiments cells were grown in M9 media with 0.2% glucose. The lysed cells gave only a 
slightly better response that unlysed cells. Therefore, cell lysis is not necessary. SEQ ID 
NOS: 1 87, 1 88 demonstrated activity on HPA in primary screening based on HPLC analysis. 
[000466} The use of a fluorescent detection system which would permits 
implementation of the assay in ultra high throughput fashion such as 1536 well or gigamatrix 
format was investigated. The fluorescent reagent most applicable to our system is Amplex 
Red from Molecular Probes which produces the highly fluorescent resorufin 545 nm; 
590nm) Standard curves for phenylalanine and (S)-2-amino-6-hydroxy hexanoic acid were 
established (0-100 |aM). 

[000467] In preparation for assay automation, nitrilase expressing cells were added into 
microtiter plate containing M9 0.2% glucose,0.25 mM IPTG media by florescence activated 
cell sorting (FACS). Three nitrilase expressing subclones, and the empty vector control were 
evaluated: SEQ ID NOS:101, 102, SEQ ID NOS:187, 188, SEQ ID NOS:29, 30 and the 
empty vector. The viability of the cells following cell sorting proved to be inconsistent. Thus 
colony picking is currently being evaluated as an alternative method to add cells into 
microtiter plates. The evaporative loss from an uncovered 1536-well microtiter plate is 
approximately 30% per day in the robot incubator (incubator conditions: 37°C at 85% 
relative humidity (RH)). Incubation in the 95% RH incubator reduced evaporative loss to 1% 
per day. 
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[000468] The ability of the three subclones to grow in the presence of up to 3.5 mM of 
nitrile was established using HPA nitrile. Growth rates were only slightly retarded (less that 
30%). Subclones grown in the presence of HPA were shown to express a nitrilase that 
catalyzes the formation of hydroxy norleucine (HNL) as established using the Amplex Red 
detection system. Only S was evaluated as the enzymes are S-selective. The reaction plate 
was read at 10 minute intervals, with 40 minutes showing the best linearity. While cell 
growth is significantly inhibited above 5 mM of HPA when the cells were grown at pH 7, 
growth was inhibited above 0.1 mM HPA for cells grown at pH 8. 
[000469] In order to verify the AAO results by HPLC, a reaction was performed using 
high concentrations of HPA, up to 40 mM (due to HPLC detection challenges for (S)-2- 
amino-6-hydroxy hexanoic acid) and lyophilized cell lysate SEQ ID NOS:187, 188. 



Comparison AAO and HPLC data for HNL 







%ee 


%conversion 


[HNL] 








mM 


AAO 


HPLC 


AAO HPLC 


40 


89% 


100% 


17% 18% 


30 


89% 


97% 


29% 36% 


20 


86% 


97% 


21% 34% 


10 


78% 


98% 


13% 35% 



[000470] • In order to determine if conducting the screen at a lower concentration 
introduces a bias in the results compared to the 20 mM substrate range that was used for 
HPLC based screens, an experiment was performed with SEQ ID NOS:187, 188 using three 
concentration ranges. Each experiment was done in triplicate in order to remove any 
nonsystematic error. 

[000471] The AAO assay can be run on 384 or 1536 well format with cells sorted into 
an M9 0.2% glucose, 0.25 mM IPTG media. Cells can be grown in the presence of nitrile (in 
this case HPA), or the cells can be allowed to reach a certain density and the nitrile can then 
be added. Though cell lytic reagents do not interfere with the assay, when HPA was assayed, 
addition of the lytic reagents was found to be unnecessary. Either pre- or post- nitrile 
addition, the mother plate will have to be split into daughter plates, which are then assayed 
for the respective L- and D- enantiomer content. Incubation times with the AAO/Amplex 
Red reagents can be adjusted so that the D- and L- plate are read at separate times. 
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Example 16: Identification, Development and Production of Robust Novel Enzymes 
Targeted for a Series of High- Value Enantioselective Bioprocesses 
[000472] The invention provides for the development of nitrilases, through directed 
evolution, which provide significant technical and commercial advantages for the process 
5 manufacturing of the following chemical target: 



L-2-amino-6,6-dimethoxyhexanoic acid 



H 2 N^ 



HCN,NH 4 + Me °| H 2 N ~. OMe _7^C00H 




10 



r CN 

MeO. ^~ _ ** /^^\/\ 

„>^ X ^ X ^ CH0 OMe H OMe, 

MeO 

5,5-dimethoxypentanal 5,5-dimethoxypentanal L^-amino-e.e- 

aminonitrile dimethoxyhexanoic 
[000473] acid 

Nitrilase enzymes were shown to hydrolyze 2-amino-6-hydroxy hexanenitrile to (S)- 

2-amino-6-hydroxy hexanoic acid, with selectivity towards the L-enantiomer. The panel of 

nitrilases was screened against the target, 5,5-dimethoxypentanal aminonitrile. The positive 

1 5 enzymes were characterized on this substrate. A primary screen is used to identify putative 

up-mutants, which is then confirmed using EffLC. 

[000474] GSSM™ and GeneReassembly™ are performed on selected nitrilases, in 
order to improve the enantioselectivity and activity of the enzymes for the production of L-2- 
amino-6,6-dimethoxyhexanoic acid. Nitrilases were identified for the enantioselective 

20 hydrolysis of 2-amino-6-hydroxy hexanenitrile to L-(S)-2-amino-6-hydroxy hexanoic acid. 
However, a slight structural difference is presented by the new target molecule, L-2-amino- 
6,6-dimethoxyhexanoic acid. In order to determine whether this difference affects the 
activity and enantioselectivity of the enzymes, the complete spectrum of nitrilases was 
screened against the new target. 

25 [000475] First, identification of the correct target gene for GSSM through more detailed 
characterization of the hit enzymes for the production of I^2-amino-6,6-dimethoxyhexanoic 
acid was carried out. This effort involves a more extensive investigation of the effects of pH 
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and temperature on activity and enantioselectivity and a more in-depth analysis of the 
stability of the enzyme to process conditions. Prior to initiation of the screening, the 
synthesis of a single enantiomer of an alkyl aminonitrile is done; the racemization of this 
nitrile is studied, in an effort to understand the relationship between this factor and 
5 enantioselectivity of the enzymes. 

An enzyme exhibiting the highest combination of activity and enantioselectivity for 
the production of L-2-amino-6,6-dimethoxyhexanoic acid is selected for GSSM. Following 
the mutation of the target enzyme, the resulting mutants are screened on 5,5- 
dimethoxypentanal aminonitrile, using high throughput screening technology. Following 
10 confirmation of the up-mutants by HPLC analysis a decision point is reached, in order to 
evaluate the results of the GSSM on the target. 

[000476] In parallel to GSSM™, a GeneReassembly™ is performed on a combination 
of parent enzymes, at least one of which is selected for activity and enantioselectivity on L-2- 
amino-6,6-dimethoxyhexanoic acid. At least two other nitrilases are reassembled with the 
15 former enzyme(s); these enzymes are selected in order to provide diversity to the reassembled 
sequences. 

[000477] The present invention provides for development of racemization conditions for 
the original substrate aminonitriles. In addition, the present invention provides for the 
identification of enzymes capable of the conversion of these aminonitriles to the target ctr 

20 amino acids by dynamic kinetic resolution. The present invention also provides for 
screening and development of a nitrilase-catalyzed kinetic resolution process for (R)-2- 
amino-6,6-dimethoxy hexanoic acid (allysine) production. (S)-2-amino-6-hydroxy hexanoic 
acid will be used as a model substrate for development of the kinetic resolution. 
The target cc-amino acid products are shown below: 

25 (i) D-4-Fluorophenylglycine 




4-fluorobenzaldehyde 4-fluorophenylglycinonitrife (FPGN) D-4-fiuorophenylglycine 
(ii) L-2-Amino-6,6-dimethoxyhexanoic acid (Allysine) 
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MeO. 




CHO 



HCN, NH 4 * 




OMe NH 2 



nitrilase 




OMe NH 2 



MeO 



MeO' 



CN 



MeO' 



C0 2 H 



5,5-dimethoxypentanal 



5,5-dimethoxypentanal 
aminonitrile (DMPAN) 



L-2-amino-6,6-dimethoxy- 
hexanoic acid 



10 



15 



20 



25 



[000478] Conditions are developed for the racemization of the aminonitrile substrates 
for the nitrilase-catalyzed production of D-4-fluorophenylglycine and 2-amino-4,4-dimethyl 
pentanenitrile (allysine). Two model substrates, phenylglycinonitrile and pentanal 
aminonitrile are used initially, and racemization is studied in the absence of the enzyme. 
Concurrently determination of the performance of one or more available nitrilases under a 
variety of possible racemization conditions is carried out. In addition, the nitrilases are 
screened against hydroxypentanal aminonitrile for the production of (S)-2-amino-6-hydroxy 
hexanoic acid, and the promising enzymes are optimized. Once racemization conditions are 
established, the nitrilases are screened for activity. Further optimization for akinetic 
resolution of the product is performed. 

[000479] A number of enantioselective nitrilases were identified for the hydrolysis of a- 
aminonitriles to ot-amino acids. While these enzymes were shown to have a preference for 
the required enantiomer of certain aminonitriles, a limiting factor in the further screening, 
development and comparison of candidate nitrilases is the rate of racemization of the 
aminonitrile substrates under the reaction conditions. 

Aromatic aminonitrile racemization 

[000480] The first step is to establish conditions under which aromatic aminonitrile 
racemization occurs, using the model substrate, phenylglycinonitrile. Racemization strategies 
include, but are not limited to the list below. Options are roughly prioritized according to 
their commercial applicability. 

(1) Manipulation of the pH of the reaction. Since it has been shown that racemization is rapid at high 
pH, this approach requires the discovery and optimization of nitrilases which are active and selective 
atpH>10. 

(2) Addition of known chemical racemizing agents, such as aldehydes, ketones, weak bases, resins, 
metal ions, Lewis acids etc., which can enhance racemization at lower pH. 

(3) Synthesis of N-acylated aminonitrile derivatives, e.g. N-acetyl phenylglycinonitrile, which may be 
more easily racemized. In the case of N-acetyl phenylglycinonitrile, a selective D-acylase which 
removes the acetyl group would enhance the optical purity of the nitrilase product. 

(4) Use of a biphasic system in which base-catalyzed racemization occurs in the hydrophobic organic 
phase and enzymatic hydrolysis in the aqueous phase. 
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(5) Use of a 2-enzyme system comprised of a nitrilase and an aminonitrile racemase. One amino acid 
racemase is commercially available at present, and will be tested for activity against phenyl- and 
fluorophenylglycinonitrile. Gene libraries will be searched for genes showing homology to known 
amino acid amide racemases, hydantoin racemases or any other racemases which can be identified. 
5 [000481] Once conditions for this racemization have been established, they provide the 
basis for development of conditions for racemization of the target aromatic substrate, 4- 
fluorophenylglycinonitrile (FPGN). The FPGN is expected to be less stable than the model 
substrate; thus, it may racemize more quickly, but degradation reactions may be faster as 
well. The ability of sample enzyme(s) to tolerate and/or function well under them is 
10 evaluated. Final optimization of screening methods include the target substrates, sample 
nitrilases, and substrate racemization conditions. 

[000482] Investigations carried out have shown that phenylglycinonitrile is easily 
racemized at pH 10.8. However, it does not appear that any of the existing enzymes can 
tolerate such harsh conditions of pH. Samples from highly alkaline environments are 
15 screened for the presence of nitrilases which are tolerant to such conditions. Once 

discovered, the enzymes are sequenced and subcloned, and the enzymes are produced as 
lyophilized cell lysates ready for screening. 
Aliphatic aminonitrile racernization 

[000483] A model aliphatic aminonitrile, pentanal arninonitrile, is synthesized in its 
20 racemic form. However optically enriched samples are prepared using one the following 
approaches: (i) preparative chiral HPLC; (ii) diastereomeric salt resolution; (iii) 
diastereomeric derivatization or column chromatography; (iv) synthesis from L-W-BOC 
norleucine. An HPLC assay is used for the detection of these compounds. 
HPLC Assay 

25 [000484] An HPLC assay for the detection of the (S)-2-amino-6-hydroxy hexanoic acid 
is used. An assay involving pre-column derivatization is used. 
Screening/Characterization: 

[000485] Nitrilases are screened against 2-amino-6-hydroxy hexanenitrile. For enzymes 
capable of performing well at greater than 25 mM substrate, scale up reactions are performed. 
30 The substrate/product tolerance and stability profiles of the other enzymes are investigated. 
[000486] The nitrilases are screened, and hits are characterized, focusing on pH and 
temperature optimum, enantioselectivity and stability under the reaction conditions. 
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Enzyme Evolution 

[000487] A target enzyme exhibiting the desired properties is selected for GSSM™. 
Following the mutation of the target enzyme, the resulting mutants are screened on the 
substrate using high throughput screening technology. Once the up-mutants have been 
5 confirmed by HPLC analysis, the individual mutations responsible for increased performance 
may be combined and evaluated for possible additive or synergistic effects. 
[000488] In addition, a GeneReassembly™ will be performed on a combination of lead 
enzymes, which are selected for their desirable characteristics, including activity, 
enantioselectivity and stability in the reaction. 

10 Example 17: Optimization of Nitrilases for the Enantioselective 
Production of (SVPhenvllactic Acid 

[000489] Nitrilases were identified for the enantioselective hydrolysis of 5 different nitrile 
substrates. These nitrilases were isolated and optimized for selected targets. The optimization 
involves process optimization and directed evolution. In particular, enzymes specific for the 
15 production of (S)-phenyllactic acid were characterized and optimized. This was aimed 
primarily at improving the activity of the enzymes, while maintaining a high 
enantioselectivity. An investigation into the effects of process conditions on the enzymes 
was also performed. 

- CN ^\ ^COOH 

"CHO 



HCN 

- 




nitrilase 

0H 




Phenylacetaldehyde Phenylacetaldehyde (S)-Phenyllactic acid 

cyanohydrin 

20 [000490] The development of high throughput assays for screening of mutants from 
potential directed evolution efforts was accomplished. Two achiral and two chiral 
colorimetric assays that are amenable to high throughput screening were developed and used 
for nitrilase directed evolution. 

[000491] SEQ ID NOS:103, 104 was identified as a highly enantioselective nitrilase for 
25 the production of (S)- phenyllactic acid. Characterization of SEQ ID NOS : 1 03, 1 04 shows 
the optimum reaction pH and temperature to be pH 8 and 37°C, respectively; the reaction 
starting material, phenylacetaldehyde, and the product, phenyllactic acid showed no effect on 
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the enzyme activity up to levels of 5 mM and 30 mM, respectively. The scaled-up enzymatic 
reaction with an enantiomeric excess (ee) of 95%. 

Example 18: Directed Evolution of a nucleic acid encoding a nitrilase enzyme. 
[000492] The nitB gene (GenBank Accession No. AX025996, from Alcaligenes 
5 faecalis) was subjected to Gene Site Saturated Mutagenesis™ or GSSM™ to generate a 
library of single amino acid substitution mutants covering the entire enzyme. The sequence 
of the "parental" nitB gene used in the directed evolution is SEQ ID NO: 103, 104. A nitB 
mutant library was generated from carrying out GSSM™. This nitB mutant library was then 
screened for clones with increased whole cell hydroxymethylthiobutryonitrile (HMTBN, 
1 0 which is a nitrilase substrate) activity. The product of the nitrilase reaction on that substrate 
is hydroxymethylthiobutyric acid (HMTBA). 

[000493] Assays were run at 35°C with lOOmM HMTBN and lOOmM K 3 P0 4> pH 7 to 
approximately 30-40% conversion. Two methods were used to quantitate HMTBN 
conversion, one being direct measurement of HMTBS produced by HPLC analysis and the 
15 other being indirect detection of residual HMTBN using the fluorescent cyanide assay, which 
has previously been described. 

[000494] Putative nitB up mutants were subjected to a secondary assay to confirm the 
increased activity. In the secondary assay, up mutants and the wild type control were induced 
in expression medium in shake flasks. Shake flask cultures are then washed with lOOmM 

20 K3PO4, pH7 and resuspended to the same optical density at 660nm. Kinetic assays were then 
performed with the normalized cell resuspensions under the same conditions used in the 
initial assays. Putative up mutants confirmed to have increased HMTBN activity were 
sequenced and tested for increased activity after transformation back into the same expression 
strain to ensure that increases in activity are not due to host mutations. 

25 [000495] A confirmed nitB GSSM™ up-mutant is nitB G46P, which contains a glycine 
(GGT) to proline (CCG) substitution at amino acid 46. The whole cell HMTBN activity of 
this mutant is approximately 50% greater than that of wild type NitB at both 25°C and 35°C. 
Upon identification of the beneficial G46P mutation, GSSM™ was used again to generate a 
pool of double mutants using the nitB G46P template. These mutants all contain the G46P 

30 mutation and an additional single amino acid substitution at a random site. The double 

mutants were assayed for HMTBN activity greater than that of nitB G46P. Double, triple and 
quadruple mutants were created in order to speed up the mutation process and identify 
beneficial mutations more quickly. After the first few beneficial mutations were identified 

130 



WO 03/097810 PCT/US03/15712 
and isolated, they were combined to generate double mutants, the best of which was DM18. 
DM18 was used as a template to generate triple mutants. The most active triple mutant was 
TM3 and that was used as a template to generate quadruple mutants. The most active 



quadruple mutant was QM2. The table summarizes these mutations. 



mutant 


mutation 1 


mutation 2 


mutation 3 


mutation 4 


DM18 


R(gcg)29C(tgt) 


Y(tac)207M(atg) 






TM3 


R(gcg)29 C(tgt) 


Y(tac)207M(atg) 


L(ctt) 170 T(act) 




QM2 


R(gcg)29C(tgt) 


Y(tac)207M(atg) 


L(ctt) 170 T(act) 


A(gcg)197N9(aat) 



5 

[000496] The mutants were characterized first by studying their whole cell HMTBN 
activity. At 100 mM HMTBN, the HMTBS production rate of QM2 is 1.2 times greater than 
that of the parental gene. However, at 200 mM HMTBN, the rate of QM2 is 3.6 times that of 
the parental gene. The productivity of these mutants is increased considerably when the 



10 HMTBN concentration is raised from 100 mM to 300 mM. As to conversion rates, TM3 
completely converted the substrate after 270 minutes and both DM18 and SM show greater 
than 75% conversion after this time. To further address the issue of HMTBN concentration 
effects on activity/productivity of NitB, several mutants were assayed at both 400 mM and 
528 mM HMTBN. NitB is essentially inactive at these substrate concentrations, however the 

15 mutants retain significant activity at these concentrations. In particular, the activity at these 
high concentrations were essentially the same as their activity at 200 mM substrate. 
Therefore, the mutants can be used over a wide substrate concentration range and provide 
much more flexibility in utility than the NitB parental gene. 

[000497] The mutants were shown to have higher expression levels than the parental 
20 gene and it also appeared tha the QM2 and TM3 mutants contained a greater proportion of 
soluble enzyme than the wild type as seen in SDS-PAGE analysis. As to stability, all of the 
enzymes showed essentially the same stability pattern at both 25°C and 35°C. 
[000498] Finally, the mutants were subjected to codon-optimization. The approach was 
to optimize the codons and therefore increase the expression levels in the particular host cell. 
25 That would, in turn, increase the activity per cell of the enzyme. This resulted in increased 
whole cell activity in the codon-optimized mutants as compared to controls. The increase in 
activity was approximately double the activity. An E. eoli expression system was used. 

Example 19:Selected Examples of Compounds Produced From aNitrilase-Catalvzed 
Reaction 
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[000499] The compounds listed in Figure 1 5 are selected compounds that can be produced 
from a nitrilase-catalyzed reaction using an enzyme and/or a method of the invention. 
[000500] In addition, the following are potential products which can be made via the 
nitrilase Strecker format. More than 100 amino acids and many new drugs can be produced 
from their respective aldehydes or ketones utilizing the nitrilase enzymes of the invention. 
For example, large market drugs which can be synthesized using nitrilases of the invention 
include homophenylalanine, VASOTEC™, VASOTERIC™, TECZEM™, PRINWIL™, 
PRINZDDE™, ZESTRIL™, ZESTORETIC™, RAMACE™, TARKA™, MAVIK™, 
TRANDOAPRDL™, TRANDOLAPRILAT™, ALTACE™, ODRIK™, UNIRETIC™, 
LOTENSIN™, LOTREL™, CAPOTEN™, MONOPRIL™, TANATRIL™, ACECOL™, 
LONGES™, SPIRAPRIL™, QUINAPRIL™, and CILAZAPRIL™. Other chiral drugs 
include DEMSER™ (alpha-methyl-I/Tyrosine), ALDOCHLOR™, LEVOTHROID™ 
SYNTHROID™, CYTOMEL™, THYOLAR™, HYCODAN™, CUPRIMINE™, 
DEPEN™, PR3MAXIN™, MIGRANOL™, D.H.E.-45, DIOVAN™, CEFOBID™, L~ 
DOPA, D-DOPA, D-alpha-methyl-DOPA, L-alpha-methyl-DOPA, L-gamma- 
hydroxyglutamate, D-gamma-hydroxyglutamate, 3-(2-naphthyl)-L-alanine, D-homoserine, 
and L-homoserine. 

[000501] Furthermore, the nitrilase enzymes of the invention can be useful for synthesizing 
the following amino acids. Many of these amino acids have pharmaceutical applications. D- 
phenylglycine, L-phenylglycine, D-hydroxyphenylglycine, L-hydroxyphenylglycine, L- 
tertiary leucine, D-tertiary leucine, D-isoleucine, L-isoleucine, D-norleucine, L-norleucine, 
D-norvaline, L-norvaline, D-2-thienylglycine, L-2-thienylglycine, L-2-aminobutyrate, D-2- 
aminobutyrate, D-cycloleucine, L-cycloleucine, D-2-methylphenylglycine, L-2- 
methylphenylglycine, L-thienylalanine, and D-thienylalanine. 
[000502] The enzymes of the nitrilase enzymes of the invention can be useful for the 
synthesis of the following natural amino acids: glycine, L-alanine, L-valine, L-leucine, L- 
isoleucine, ^phenylalanine, L-tyrosine, L-tryptophan, L-cysteine, L-methionine, L-serine, D- 
serine, L-threonine, L-lysine, L-arginine, L-histidine, L-aspartate, L-glutamate, L-asparagine, 
L-glutamine, and L-proline. The following are examples of unnatural amino acids which can 
be produced using the nitrilase enzymes of the invention. D-alanine, D-valine, D-leucine, D- 
isoleucine, D-phenylalanine, D-tyrosine, D-tryptophan, D-cysteine, D-methionine, D- 
threonine, D-lysine, D-arginine, D-histidine, D-aspartate, D-glutamate, D-asparagine, D- 
glutamine, and D-proline. 
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[000503] Furthermore, nitrilase enzymes of the invention can be used in non-Strecker 
chemical reactions including the synthesis of more chiral drugs such as TAXOTERE™ as 
well as chiral drugs containing 3-hydroxy-glutaronitrile (a $5.5B market); LMTOR™, 
BAYCOL™, and LESCOL™. Chiral product targets that are not drugs include 
5 PANTENOL™, L-phosphinothricin, D-phosphinothricin, D-fluorophenylalanine, and L- 
fluorophenylalanine. Finally, nitrilase can be used to produce unnatural amino acid 
compounds lacking a chiral center such as sarcosine, iminodiacetic acid, EDTA, alpha- 
aminobutyrate, and beta-alanine. 

Figure 16 examples of substrates and products produced by the nitrilases of the 
1 0 invention and/or the methods of the invention. The chemical structures of the substrates and 
of the products are shown. The chemical reactions shown here are non-limiting examples of 
activities of the nitrilases of the invention. 

Example 20: Exemplary Preparation Using a Polypeptide of a Variant of SEP ID NO:210 
[000504] The variant, nitrilase 1506-83-H7A, is SEQ ID NO:210 with the Ala at residue 
15 190 replaced with His. At the codon level, the mutation that occurred was GCT to CAT. 
This variant exhibits improved enantioselectivity in the conversion of 3- 
hydroxyglutarylnitrile (HGN) to (R)-4-Cyano-3-hydroxybutyrate. 

[000505] This variant has been demonstrated to perform this transformation in 100 mM pH 
7 sodium phosphate buffer at room temperature. This mutant can perform in other buffer 

20 systems and temperatures as well with the potential for providing additional altered 

properties. Exemplary properties include, but are not limited to, altered rates of the reaction, 
% ee, and stability. In particular, the altered properties can be a higher reaction rate, a higher 
% ee, and greater stability. Altered properties can be an increase or decrease of at least 25%, 
30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95% more than 

25 wildtype. 

[000506] This variant was shown to perform the transformation by producing products in 
high enantiomeric excess of 10 mM to 3 M substrate (HGN). Higher or lower substrate 
concentrations are also possible. Enantiomeric excess greater than or equal to 95% have been 
achieved. However, enantiomeric excess can be at least 25%, 30%, 35%, 40%, 45%, 50%, 
30 55%, 60%, 65%, 70%, 75%, 80%, 85%, or 90% more than wildtype. 

[000507] Variants of the SEQ ID NOs: of the invention, can be cloned into expression 
vectors. For example, variants of nucleic acid sequence SEQ ID NO:195, 205, 207, 209, OR 
237, and nucleotides that encode the variants of amino acid sequence SEQ ED NO:210 can 
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be cloned into exemplary vectors that include, but are not limited to, pSE420 (E. coli 
expression vector) and pMYC (pseudomonas expression vector). 

Example 21 : Preparation using variants of the invention : 

[000508] Add 3-Hydroxyglutaronitrile (1 g, 9 mmol) drop-wise to a stirred solution of 
nitrilase cell lysate (normalized for 150 mg protein content) in 2.12 mL of 100 mM pH 7 
sodium phosphate buffer at room temperature, -22 °C. Stir this 3 M reaction by magnetic stir 
bar for 24 hours at room temperature. Monitor the progress of the reaction by TLC (Thin 
Layer Chromatography) and GC (Gas Chromatography). The reaction should be complete 
within 24 hours. 

[000509] Other variants contemplated herein include, but are not limited to the following: 
Nl 1 IS; A190H, S, Y or T; F191L, V, M, D, G, E, Y or T; M199E, orL; D222L; A55K, G, or 
Q; I60E, or any combination thereof. 

Example 22: Screening Assay for Enantioselective Transformation 
[00051 0] A new method to screen for enantioselective transformation, for example, of a 
prochiral substrate into a chiral one that affords the ability to monitor enantiomeric excess (% 
ee) of the resultant product is disclosed. This approach can also be applied to detennine 
diasteromeric excess (% de). 

[00051 1] For example, by labeling one of the two prochiral or enantiotopic moieties in a 
molecule, for example by the use of a heavier or liter isotope, the modification of one of the 
two moieties by a selective catalyst, for example, an enzyme, can be established by mass 
spectroscopy (MS). 

[000512] By performing the exemplary nitrilase reaction on 15 N-(R)-HGN (R) (as shown in 
Figure 17) or 15 N-(S)-HGN, one can determine the enantioselectivity of the enzyme by 
analyzing the amount of each of the two possible labeled versus unlabeled acid products 
which can be fontned. 



15, 



OH 





OH 

"NCs^A^COOH FM= 130.04 
*N (S)-4-cyano-3-hydroxy butyric acid 



15, 
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[000513] The screening experiment may be performed in either direction. The screening 
experiment can be used for both the 15N-(R)-and (S)-HGN moieties. In fact, to ensure that 
the label does not effect any artifactual changes, at the onset, both should be investigated. 



OH 

^NCs^Jk^-CN 15n ( R ^^ yano-3 _ hydroxy butyric acjd 



OH SSst 601 - ^O^COOH FM = 130.04 



OH 

FM = 111.05 pros selective^ M ~ T ^ 

nitrilase NC Vs ^^COOH FM = 129.04 

(S)-4-cyano-3-hydroxy butyric acid 



[000514] To equate the observed enantiomeric excess resulting from the nitrilase 
transformation, the following exemplary formula may be applied: 

% ee - {[130]- [129]}/ {[130] + [129]}, where each concentration of the light acid 
(129) and the heavy acid (130) are determined by correlation of the peak area on the mass 
spectrometer to a standard curve or by direct comparison of the areas of each of the 129 and 
130 mass peaks. The actual mass units used to determine the relative amounts of each of the 
two enantiomers (labeled and unlabelled) are dependent on how the mass spectrometer is 
tuned. 

[0005151 In some cases, the % ee observed by mass spectrometry may differ by a factor 
from that observed by an alternate analytical technique such as liquid chromatography due to 
background or contaminating peaks resulting from natural isotopic abundance. This does not, 
however, affect the final outcome of the screening process. Exemplary strandard curves for 
quantification of heavy acid and light acid are shown in Figures 14 A and B. 
[000516] The following reaction is a possible synthetic route to prepare, for example, 1 5N 
(R)-HGN using chemistry techniques known in the art with commercially available starting 
materials. 

OH KC 15 N OH 

[000517] The amount of each of the two possible stereomeric outcomes can be established 
by the use of MS in either positive mode, negative mode and from analysis of either of the 
parental mass or of any fragmentation mass. 
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Example 23: Stability and activity of exemplary enzymes of the invention 
Enzyme stability : 

Wild-type enzyme (SEQ ID NOS:209 and 210) was compared to mutant A190H of SEQ ID 
NOS:209 and 210. In the experiment, each enzyme was incubated at 10 mg/ml in water for 
1, 25, 50, 75 and 150 hours at 4°C and at 21 °C, on two different substrates: adiponitrile and 
hydroxyglutaryl nitrile. Both enzymes, in all conditions, were found to retain activity for 150 
hours. The wild-type enzyme showed greater activity on adiponitrile, while the mutated 
(A190H) enzyme showed greater activity on hydroxyglutaryl nitrile, as assessed by the 
Nitroprusside Bertholet assay (see, e.g., Fawcett, J. K. & Scott, J. (1960); J. Clin. Path.; Vol. 
13,pgl56). 



GSSM ,ra Variant of 


lOOmM 


2.25 mM 


Time to 


SEQE>NOS:209and 


hydroxyglutaryl 


hydroxyglutaryl 


completion 


210 


nitrile 


nitrile 


(hours) 




ee% 


ee% 


A55G 


96.5 ± 0.4 


Not determined 


>160 


A55K 


94.7 ± 0.2 


Not determined 


>160 


I60E 


96.5 ± 0.5 


Not determined 


>160 


Nil IS 


95.8 ± 0.5 


96.1 ± 0.9 


>160 


A190T 


96.5 ± 0.2 


96.6 ±0.4 


40 


A190S 


96.8 ± 0.2 


95.5 ±0.7 


40 


A190H 


97.9 ± 0.1 


98.1 ±0.1 


15 


F191L 


97.9 ±0.1 


Not determined 


>160 


F191T 


97.9 ±0.1 


Not determined 


>160 


F191M 


97.9 ±0.1 


Not determined 


>160 


F191V 


97.9 ±0.1 


Not determined 


>160 


M199E 


97.9 ± 0.1 


Not determined 


160 


M199L 


97.9 ± 0.1 


95.4 ±0.1 


>160 


Wild type SEQ ID 


94.5 ± 0.1 


87.8 ± 0.2 


24 


NOS:209and210 









GSSM mutants with enhanced enantioselectivity 



1 00 mM reactions were performed with nitrilase expressed from E. coli in whole cell fonnat 
and were complete with 36 hours. 2.25 M reactions were performed with nitrilase as 
lyophilized clarified cell lysate. All % ee data reported are the average of three 
measurements, with standard deviation of the mean. The time for reaction completion was 
approximated by TLC. 

Specifically: 
Nitrilase Activity Assay, 100 mM HGN : 
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Putative nitrilase up-mutants were assayed in triplicate. Each transfonnant was grown in 5 
mL LB (100 ^g/mL ampicillin), at 37 °C, 220 rpra for 1 8 h. The overnight culture was diluted 

2- fold and nitrilase expression induced at 37 °C, 220 rpm with 0.1 mM IPTG for 6 h. Cells 
were harvested by centrifugation, washed in 100 mM pH 7 sodium phosphate buffer and then 
re-suspended in 1 mL of 100 mM HGN in 100 mM pH 7 sodium phosphate buffer. Reactions 
were allowed to proceed for at least 36 h at 22 °C with gentle agitation. Reaction progress 
was monitored by TLC (1:1 EtOAc:Hexanes, RfH).5, nitrile; Rf=0.0, acid). Cells and other 
debris were removed by centrifugation and the treated with one volume methanol prior to 
lyophilization. The lyophilizate was re-suspended in methanol and treated with TMS- 
diazomethane (10 equivalents, 2 M solution in hexanes) until gas evolution ceased and 
yellow color persisted in order to prepare the methyl ester for GC analysis. Selected nitrilase 
variants producing (R)-(-)-3-hydroxy-4-cyanobutyric acid of 95% ee or greater were then 
evaluated for performance at 2.25 M HGN. 

Nitrilase Activity Assay at 2.25 M 3-HGN: 

3- HGN (0.2 g, 1.8 mmol, 3 M) was suspended in sodium phosphate buffer (0.6 mL, pH 7, 
100 mM) at 22 °C. Cell lysate (6 mg, normalized for nitrilase content) was added to bring the 
concentration to 11 mg/ml enzyme and the reaction shaken (100 rpm, 22 °C). Reaction 
progress was monitored by TLC (1:1 ethyl acetate:hexanes, Rf = 0.32, nitrile; Rf = 0.0, acid) 
The reaction mixture was treated with one part methanol prior to lyophilization. The 
lyophilizate was re-suspended in methanol and treated with 10 equivalents of TMS- 
diazomethane (10 equivalents, 2 M solution in hexanes) to prepare the methyl ester and 
analyzed by GC. 

Description of novel high throughput LC/MS method for screening high numbers of samples: 
Ultra High-throughput Primary Chiral Activity Screen: 

Distinct members of the GSSM library were arrayed into 384 well plates 
containing 40 \xL of (Luria-Bertani) LB medium (100 ng/mL ampicillin) via an automated 
colony picker and then incubated at 37 °C, 85% humidity. Nitrilase expression was induced 
with 0.1 mM IPTG at 37 °C for 24 h. Each plate was replicated and 20% glycerol stocks 
prepared for archival at -80 °C. To each 384 well plate was added 10 mM 15N-(R)-1 
substrate. The plates were incubated at 37 °C, 85% humidity for three days. Cells and other 
debris were removed by centrifugation and the supernatant was diluted 17,576-fold prior to 
MS analysis. 
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LC/MS ionspray was applied for high through-put analysis in the following 
manner. High-throughput screening was achieved by flow injecting samples from 384-well 
plates using a CTCPAL autosampler (Leap Technologies, Carrboro, N.C.). An an isocratic 
mixture of 71% acetonitrile, 29% water, with 0.1% formic acid, provided by LC-lOADvp 
pumps (Shimadzu, Kyoto, Japan) at 2.2 mL/min through an LC-18 cartridge (Supelco, 
Bellefonte, PA) was used. Samples were applied to an API 4000 Turbolon spray triple- 
quadrupole mass spectrometer (Applied Biosystems, Foster City, CA). Ion spray and 
Multiple Reaction Monitoring (MRM) were performed for analytes in the negative ion mode, 
and each analysis took 60 seconds. 

E. coli transformed with wild type enzyme (SEQ ID NOS:209 and 210) was 
used as a positive activity control and E. coli transformed with empty vector was used as the 
negative activity control. The % ee of the WT enzyme positive control determined by mass 
spectrometry using either 15N-(R)-1 or 15N-(S)-1 were the same, thus demonstrating the 
absence of a significant isotope effect. 



Temp (°C) 


pH 


Sodium phosphate buffer 
cone. (raM) 


%ee 


Std. Dev. 


4 


7 


100 


98.7 


0.1 % 


19 


7 


100 


98.7 


0.1 % 


21 


7 


100 


98.6 


0.1 % 


37 


7 


100 


98.4 


0.1 % 


21 


7 


100 


98.6 


0.1 % 


21 


6 


100 


98.6 


0.1 % 


21 


8 


100 


98.6 


0.1 % 


21 


7 


100 


98.5 


0.1 % 


21 


7 


50 


98.6 


0.1 % 


21 


7 


25 


98.7 


0.1 % 



Effect of reaction parameters on SEQ ID NOS:209 and 210 with the A190H mutation. 



Reactions were performed at 3 M HGN concentration with 150 mg/ml protein 
(-49 mg/ml enzyme). % ee was determined by GC analysis in triplicate runs. 



[00051 8] While the invention has been described in detail with reference to certain 
preferred aspects thereof, it will be understood that modifications and variations are within 
the spirit and scope of that which is described and claimed. 
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What is claimed is: 

1 . An isolated or recombinant nucleic acid comprising nucleotides having a sequence at 
least 50% identical to SEQ ID NO: 195, 205, 207, 209, or 237, variants of SEQ ID 

5 NO:195, 205, 207, 209, or 237, having one or more mutations: at positions 163-165 

AAA, AAG, GGT, GGC, GGA, GGG, CAA, or CAG; at positions 178-180 GAA or 
GAG; at positions 33 1 -333 TCT, TCC, TCA, TCG, AGT, or AGC; at positions 568- 
570 CAT, CAC, TCT, TCC, TCA, TCG, AGT, AGC, ACT, ACC, ACA, TCA, TAT, 
TAC, ATG or ACG; at positions 571-573 TTA, TTG, CTT, CTC, CTA, CTG, GTT, 
10 GTC, GTA, GTG, ATG, ACT, ACC, ACA, GAT, GAC, GGT, GGC, GGA, GGG, 

GAA, GAG, TAT, TAC, or ACG; at positions 595-597 GAA, GAG, TTA, TTG, CTT, 
CTC, CTA, or CTG; at positions 664-666 TTA, TTG, CTT, CTC, CTA, or CTG; or any 
combination thereof, fragments thereof, wherein the nucleic acid or fragment encodes a 
polypeptide having nitrilase activity, or their complements. 

15 

2. The isolated or recombinant nucleic acid of claim 1 , wherein the nucleic acid comprises 
nucleotides having a sequence substantially identical to the SEQ ID NO: 195, 205, 207, 
209, or 237, or variants of SEQ ID NO: 195, 205, 207, 209, or 237, having one or more 
mutations: at positions 163-165 AAA, AAG, GGT, GGC, GGA, GGG, CAA, or CAG; 

20 at positions 178-1 80 GAA or GAG; at positions 33 1-333 TCT, TCC, TCA, TCG, AGT, 

or AGC; at positions 568-570 CAT, CAC, TCT, TCC, TCA, TCG, AGT, AGC, ACT, 
ACC, ACA, TCA, TAT, TAC, ATG or ACG; at positions 571-573 TTA, TTG, CTT, 
CTC, CTA, CTG, GTT, GTC, GTA, GTG, ATG, ACT, ACC, ACA, GAT, GAC, GGT, 
GGC, GGA, GGG, GAA, GAG, TAT, TAC, or ACG; at positions 595-597 GAA, 

25 GAG, TTA, TTG, CTT, CTC, CTA, or CTG; at positions 664-666 TTA, TTG, CTT, 

CTC, CTA, or CTG; or any combination thereof, or their complements. 

3 . An isolated or recombinant nucleic acid comprising nucleotides having a sequence 
identical to the SEQ ID NO:195, 205, 207, 209, or 237, fragments having nitrilase 

30 activity, or their complements. 



An isolated or recombinant nucleic acid comprising nucleotides having a sequence 
identical to a variant of SEQ ID NO:195, 205, 207, 209, or 237, having one or more 
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mutations: at positions 163-165 AAA, AAG, GGT, GGC, GGA, GGG, CAA, or CAG; 
at positions 178-180 GAA or GAG; at positions 331-333 TCT, TCC, TCA, TCG, AGT, 
or AGC; at positions 568-570 CAT, CAC, TCT, TCC, TCA, TCG, AGT, AGC, ACT, 
ACC, ACA, TCA, TAT, TAC, ATG or ACG; at positions 571-573 TTA, TTG, CTT, 
5 CTC, CTA, CTG, GTT, GTC, GTA, GTG, ATG, ACT, ACC, ACA, GAT, GAC, GGT, 

GGC, GGA, GGG, GAA, GAG, TAT, TAC, or ACG; at positions 595-597 GAA, 
GAG, TTA, TTG, CTT, CTC, CTA, or CTG; at positions 664-666 TTA, TTG, CTT, 
CTC, CTA, or CTG; or any combination thereof, fragments having nitrilase activity, or 
their complements. 

10 

5. An isolated or recombinant nucleic acid that hybridizes to a nucleic acid of SEQ ID 
NO: 195, 205, 207, 209, or 237, or variants of SEQ ID NO: 195, 205, 207, 209, or 237, 
having one or more mutations: at positions 163-165 AAA, AAG, GGT, GGC, GGA, 
GGG, CAA, or CAG; at positions 178-180 GAA or GAG; at positions 331-333 TCT, 

1 5 TCC, TCA, TCG, AGT, or AGC; at positions 568-570 CAT, CAC, TCT, TCC, TCA, 

TCG, AGT, AGC, ACT, ACC, ACA, TCA, TAT, TAC, ATG or ACG; at positions 
571-573 TTA, TTG, CTT, CTC, CTA, CTG, GTT, GTC, GTA, GTG, ATG, ACT, 
ACC, ACA, GAT, GAC, GGT, GGC, GGA, GGG, GAA, GAG, TAT, TAC, or ACG; 
at positions 595-597 GAA, GAG, TTA, TTG, CTT, CTC, CTA, or CTG; at positions 

20 664-666 TTA, TTG, CTT, CTC, CTA, or CTG; or any combination thereof, fragments 

having nitrilase activity, or their complements. 

6. The isolated or recombinant nucleic acid of claim 5, wherein the stringent conditions 
comprise at least 50% formamide, and about 37°C to about 42°C. 

25 

7. A nucleic acid probe comprising from about 15 nucleotides to about 50 nucleotides, 
wherein at least 15 consecutive nucleotides are at least 50% complementary to a nucleic 
acid target region within a nucleic acid sequence of SEQ ID NO: 195, 205, 207, 209, or 
237, or variants of SEQ ID NO: 195, 205, 207, 209, or 237, having one or more 

30 mutations: at positions 163-165 AAA, AAG, GGT, GGC, GGA, GGG, CAA, or CAG; 

at positions 178-180 GAA or GAG; at positions 331-333 TCT, TCC, TCA, TCG, AGT, 
or AGC; at positions 568-570 CAT, CAC, TCT, TCC, TCA, TCG, AGT, AGC, ACT, 
ACC, ACA, TCA, TAT, TAC, ATG or ACG; at positions 571-573 TTA, TTG, CTT, 
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CTC, CTA, CTG, GTT, GTC, GTA, GTG, ATG, ACT, ACC, ACA, GAT, GAC, GGT, 
GGC, GGA, GGG, GAA, GAG, TAT, TAC, or ACG; at positions 595-597 GAA, 
GAG, TTA, TTG, CTT, CTC, CTA, or CTG; at positions 664-666 TTA, TTG, CTT, 
CTC, CTA, or CTG; or any combination thereof, or their complements. 

8. A nucleic acid probe comprising at least 1 5 consecutive nucleotides of a nucleic acid 
target region within a nucleic acid sequence of SEQ ID NO: 195, 205, 207, 209, or 237, 
or variants of SEQ ID NO:195, 205, 207, 209, or 237, having one or more mutations: at 
positions 163-165 AAA, AAG, GGT, GGC, GGA, GGG, CAA, or CAG; at positions 
178-180 GAA or GAG; at positions 331-333 TCT, TCC, TCA, TCG, AGT, or AGC; at 
positions 568-570 CAT, CAC, TCT, TCC, TCA, TCG, AGT, AGC, ACT, ACC, ACA, 
TCA, TAT, TAC, ATG or ACG; at positions 571-573 TTA, TTG, CTT, CTC, CTA, 
CTG, GTT, GTC, GTA, GTG, ATG, ACT, ACC, ACA, GAT, GAC, GGT, GGC, 
GGA, GGG, GAA, GAG, TAT, TAC, or ACG; at positions 595-597 GAA, GAG, TTA, 
TTG, CTT, CTC, CTA, or CTG; at positions 664-666 TTA, TTG, CTT, CTC, CTA, or 
CTG; or any combination thereof, or their complements. 

9. A nucleic acid vector capable of replication in a host cell, wherein the vector comprises 
the nucleic acid of any one of claims 1 to 6, 12, or 13. 

10. A host cell comprising the nucleic acid of any one of claims 1 to 6, 12 or 13. 

1 1. A host organism comprising the host cell of claim 10. 

12. An isolated or recombinant nucleic acid encoding a polypeptide comprising amino acids 
having a sequence at least 50% identical to SEQ ID NO:196, 206, 208, 210 or 238, or 
variants of SEQ ID NO: 196, 206, 208, 210 or 238, having one or more mutations: at 
residue 55 lysine, glycine, or glutamine; at residue 60 glutamic acid; at residue 1 1 1 
serine, at residue 190, serine, histidine, tyrosine or threonine; at residue 191, leucine, 
valine, methionine, aspartic acid, glycine, glutamic acid, tyrosine or threonine; at 
residue 199 glutamic acid or leucine; at residue 222 leucine; or any combination 
thereof, fragments encoding polypeptides wherein the polypeptides have nitrilase 
activity, or its complement. 
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13. An isolated or recombinant nucleic acid encoding a polypeptide comprising amino acids 
having a sequence of SEQ ID NO:196, 206, 208, 210 or 238, or variants of SEQ ID 
NO: 196, 206, 208, 210 or 238, having one or more mutations: at residue 55 lysine, 

5 glycine, or glutamine; at residue 60 glutamic acid; at residue 111 serine, at residue 190, 

serine, histidine, tyrosine or threonine; at residue 191, leucine, valine, methionine, 
aspartic acid, glycine, glutamic acid, tyrosine or threonine; at residue 199 glutamic acid 
or leucine; at residue 222 leucine; or any combination thereof, fragments encoding a 
polypeptides having nitrilase activity, or its complement. 

10 

14. The isolated or recombinant nucleic acid of any one of claims 1 to 6, 12 or 13, wherein 
the nucleic acid is affixed to a solid support. 

15. The isolated or recombinant nucleic acid of claim 14, wherein the solid support is 

15 selected from the group of a gel, a resin, a polymer, a ceramic, a glass, a microelectrode 

and any combination thereof. 

16. An isolated or recombinant polypeptide comprising amino acids having a sequence at 
least 50% identical to SEQ ID NO:196, 206, 208, 210 or 238, or variants of SEQ ID 

20 NO:196, 206, 208, 210 or 238, having one or more mutations: at residue 55 lysine, 

glycine, or glutamine; at residue 60 glutamic acid; at residue 111 serine, at residue 190, 
serine, histidine, tyrosine or threonine; at residue 191, leucine, valine, methionine, 
aspartic acid, glycine, glutamic acid, tyrosine or threonine; at residue 199 glutamic acid 
or leucine; at residue 222 leucine; or any combination thereof, or fragments thereof, 

25 wherein the polypeptide has a nitrilase activity. 

17. An isolated or recombinant polypeptide comprising amino acid having SEQ ID 
NO:196, 206, 208, 210 or 238, or variants of SEQ ID NO:196, 206, 208, 210 or 238, 
having one or more mutations: at residue 55 lysine, glycine, or glutamine; at residue 60 

30 glutamic acid; at residue 1 1 1 serine, at residue 190, serine, histidine, tyrosine or 

threonine; at residue 191, leucine, valine, methionine, aspartic acid, glycine, glutamic 
acid, tyrosine or threonine; at residue 199 glutamic acid or leucine; at residue 222 
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leucine; or any combination thereof, or fragments thereof, wherein the polypeptide has 
nitrilase activity. 



18. The isolated or recombinant polypeptide of any one of claims 16 or 17, wherein the 
fragment is at least 20 amino acids in length, and wherein the fragment has nitrilase 
activity. 

19. A peptidomimetic of the polypeptide of claim 16 or claim 17 or fragments thereof 
having a nitrilase activity. 

20. A codon-optimized polypeptide of the polypeptide of claim 16 or claim 17, or 
fragments thereof, having a nitrilase activity, wherein the codon usage is optimized for 
a particular organism or cell. 

21. The polypeptide of claim 16 or claim 17 or fragments thereof, having a nitrilase 
activity, or a peptidomimetic thereof having a nitrilase activity, wherein the 
polypeptide, fragment, or peptidomimetic is affixed to a solid support. 

22. The polypeptide of claim 21, wherein the solid support is selected from the group 
consisting of a gel, a resin, a polymer, a ceramic, a glass, a microelectrode and any 
combination thereof. 

23. A purified antibody that specifically binds to the polypeptide of claim 16 or claim 17 or 
fragments thereof, having a nitrilase activity. 

24. A fragment of the antibody of claim 23, wherein the fragment specifically binds to a 
polypeptide having a nitrilase activity. 

25. An enzyme preparation which comprises at least one of the polypeptides of any one of 
claims 16 and 17, wherein the preparation is liquid or dry. 

26. The enzyme preparation of claim 25, wherein the preparation is affixed to a solid 
support. 
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27. A composition comprising at least one nucleic acid of claims 1 to 6, 12, or 13 or 
comprising at least one polypeptide of claim 16 or claim 17 or fragments thereof, or a 
peptidomimetic thereof, having nitrilase activity, or any combination thereof. 

5 

28. A method for hydrolyzing a nitrile to a carboxylic acid comprising contacting the 
molecule with at least one polypeptide of claim 16 or claim 17 or fragments thereof, or 
a peptidomimetic thereof, having nitrilase activity, under conditions suitable for 
nitrilase activity. 

10 

29. A method for hydrolyzing a cyanohydrin moiety or an aminonitrile moiety of a 
molecule, the method comprising contacting the molecule with at least one polypeptide 
of any one of claim 16 or claim 17 or fragments thereof, or a peptidomimetic thereof, 
having nitrilase activity, under conditions suitable for nitrilase activity. 

15 

30. A method for making a chiral alpha-hydroxy acid molecule or a chiral amino acid 
molecule, the method comprising admixing a molecule having a cyanohydrin moiety or 
an aminonitrile moiety with at least one polypeptide having an amino acid sequence at - - 
least claim 16 or claim 1 7 or fragments thereof, or a peptidomimetic thereof, having 

20 enantio-selective nitrilase activity. 

31. A method for making a composition or an intermediate thereof, the method comprising 
admixing a precursor of the composition or intermediate, wherein the precursor 
comprises a cyanohydrin moiety or an aminonitrile moiety, with at least one 

25 polypeptide of any one of claim 16 or claim 17 or fragments thereof or peptidomimetic 

thereof having nitrilase activity, hydrolyzing the cyanohydrin or the aminonitrile moiety 
in the precursor thereby making the composition or the intermediate thereof. 

32. A method for making an (T^-ethyl 4-cyano-3-hydroxybutyric acid, the method 

30 comprising contacting a hydroxyglutaryl nitrile with at least one polypeptide encoded 

by a nucleic acid having a sequence of SEQ ID NO:195, 205, 207, 209, OR 237, or a 
variant of SEQ ID NO: 1 95, 205, 207, 209, or 237, having one or more mutations: at 
positions 163-165 AAA, AAG, GGT, GGC, GGA, GGG, CAA, or CAG; at positions 
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178-180 GAA or GAG; at positions 331-333 TCT, TCC, TCA, TCG, AGT, or AGC; at 
positions 568-570 CAT, CAC, TCT, TCC, TCA, TCG, AGT, AGC, ACT, ACC, ACA, 
TCA, TAT, TAC, ATG or ACG; at positions 571-573 TTA, TTG, CTT, CTC, CTA, 
CTG, GTT, GTC, GTA, GTG, ATG, ACT, ACC, ACA, GAT, GAC, GGT, GGC, 
5 GGA, GGG, GAA, GAG, TAT, TAC, or ACG; at positions 595-597 GAA, GAG, TTA, 

TTG, CTT, CTC, CTA, or CTG; at positions 664-666 TTA, TTG, CTT, CTC, CTA, or 
CTG; or any combination thereof, or a fragment thereof encoding a polypeptide having 
nitrilase activity, that selectively produces an (X^-enantiomer, so as to make (R)~ethy\ 4- 
cyano-3-hydroxybutyric acid. 

10 

33. A method for making an (SJ-ethyl 4-cyano-3-hydroxybutyric acid, the method 
comprising contacting a hydroxyglutaryl nitrile with at least one polypeptide having an 
amino acid sequence of any one of SEQ ID NO:196, 206, 208, 210 or 238, or a variant 
of SEQ ID NO:196, 206, 208, 210 or 238, having one or more mutations: at residue 55 

15 lysine, glycine, or glutamine; at residue 60 glutamic acid; at residue 111 serine, at 

residue 190, serine, histidine, tyrosine or threonine; at residue 191, leucine, valine, 
methionine, aspartic acid, glycine, glutamic acid, tyrosine or threonine; at residue 199 = 
glutamic acid or leucine; at residue 222 leucine; or any combination thereof, or a 
fragment or peptidomimetic thereof having nitrilase activity that selectively produces an- 

20 f$-enantiomer, so as to make (3^-ethyl 4-cyano-3-hydroxybutyric acid. 

34. A method for making an (7y-mandelic acid, the method comprising admixing a 
mandelonitrile with at least one polypeptide having an amino acid sequence of any one 
of SEQ ID NO:196, 206, 208, 210 or 238, or a variant of SEQ ID NO:196, 206, 208, 

25 210 or 238, having one or more mutations: at residue 55 lysine, glycine, or glutamine; 

at residue 60 glutamic acid; at residue 111 serine, at residue 190, serine, histidine, 
tyrosine or threonine; at residue 191, leucine, valine, methionine, aspartic acid, glycine, 
glutamic acid, tyrosine or threonine; at residue 199 glutamic acid or leucine; at residue 
222 leucine; or any combination thereof, or any fragment or peptidomimetic thereof 

30 having nitrilase activity. 

35. A method for making an ^-mandelic acid, the method comprising admixing a 
mandelonitrile with at least one polypeptide having an amino acid sequence of SEQ ID 
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NO:196, 206, 208, 210 or 238, or a variant of SEQ ID NO:196, 206, 208, 210 or 238, 
having one or more mutations: at residue 55 lysine, glycine, or glutamine; at residue 60 
glutamic acid; at residue 111 serine, at residue 190, serine, histidine, tyrosine or 
threonine; at residue 191, leucine, valine, methionine, aspartic acid, glycine, glutamic 
5 acid, tyrosine or threonine; at residue 199 glutamic acid or leucine; at residue 222 

leucine; or any combination thereof, or any fragment or peptidomimetic thereof having 
nitrilase activity. 

36. A method for making an (SJ-phenyl lactic acid derivative or an (R>phenyl lactic acid 
10 derivative, the method comprising admixing a phenyllactocyanonitrile with at least one . 

polypeptide selected from SEQ ID NO:196, 206, 208, 210 or 238, or a variant of SEQ 
ID NO:196, 206, 208, 210 or 238, having one or more mutations: at residue 55 lysine, 
glycine, or glutamine; at residue 60 glutamic acid; at residue 111 serine, at residue 190, 
serine, histidine, tyrosine or threonine; at residue 191, leucine, valine, methionine, 

1 5 aspartic acid, glycine, glutamic acid, tyrosine or threonine; at residue 199 glutamic acid 

or leucine; at residue 222 leucine; or any combination thereof, or any fragment or 
peptidomimetic thereof having nitrilase activity, or any active fragment or 
peptidomimetic thereof that selectively produces an (S>enantiomer or an (R)- 
enantiomer, thereby producing an (S>phenyl lactic acid derivative or an (T^-phenyl 

20 lactic acid derivative. 

37. A method for making the polypeptide of claim 16 or claim 17 or fragments thereof, the 
method comprising 

(a) introducing a nucleic acid encoding the polypeptide into a host cell under conditions 
25 that permit production of the polypeptide by the host cell, and 

(b) recovering the polypeptide so produced. 

38. A method for generating a nucleic acid variant encoding a polypeptide having nitrilase 
activity, wherein the variant has an altered biological activity from that which naturally 

30 occurs, the method comprising 

(a) modifying the nucleic acid of any one of claims 1 to 6, 12, or 13 by 

(i) substituting one or more nucleotides for a different nucleotide, wherein 
the nucleotide comprises a natural or non-natural nucleotide; 
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(ii) deleting one or more nucleotides, 

(iii) adding one or more nucleotides, or 

(iv) any combination thereof 



39. A method for making a polynucleotide from two or more nucleic acids, the method 
comprising: 

(a) identifying regions of identity and regions of diversity between two or more nucleic 
acids, wherein at least one of the nucleic acids comprises a nucleic acid of any one of 
claims 1 to 6, 12, or 13; 

(b) providing a set of oligonucleotides which correspond in sequence to at least two of the 
two or more nucleic acids; and, 

(c) extending the oligonucleotides with a polymerase, thereby making the polynucleotide. 

40. A screening assay for identifying a nitrilase, the assay comprising: 

(a) providing a plurality of nucleic acids or polypeptides comprising at least one of the 
nucleic acids of any one of claims 1 to 6, 12, or 13, or at least one of the polypeptides of 
claim 16 or claim 17 or fragments thereof; 

(b) obtaining polypeptide candidates to be tested for nitrilase activity from the plurality; . 

(c) testing the candidates for nitrilase activity, and 

(d) identifying those polypeptide candidates which are nitrilases. 

41. A kit comprising (a) the nucleic acid of any one of claims 1 to 6, 12, or 13, or a 
fragment thereof encoding a polypeptide having nitrilase activity, or (b) the polypeptide 
of any one of claim 16 or claim 17 or fragments thereof, or a peptidomimetic thereof 
having nitrilase activity, or a combination thereof; and (c) a buffer. 

42. A method for modifying a molecule comprising: 

(a) mixing a polypeptide of any one of claim 16 or claim 17 or fragments thereof, or 
peptidomimetic thereof having nitrilase activity, with a starting molecule to produce a 
reaction mixture; 

(b) reacting the starting molecule with the polypeptide to produce the modified 
molecule. 
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43. A method for identifying a modified compound comprising: 

(a) admixing a polypeptide of any one of claim 16 or claim 17 or fragments thereof, 
or peptidomitnetic thereof having nitrilase activity, with a starting compound to produce a 
reaction mixture and thereafter a library of modified starting compounds; 
5 (b) testing the library to determine whether a modified starting compound is present 

within the library which exhibits a desired activity; 

(c) identifying the modified compound exhibiting the desired activity. 

44. A computer readable medium having stored thereon at least one nucleotide sequence 

1 0 selected from the group consisting of: SEQ ID NO: 1 , 3, 5, 7, 9, 1 1 , 1 3, 1 5, 1 7, 1 9, 2 1 , 

23, 25, 27, 29, 31, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 
71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 
113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 
147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 

15 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 21 1, 213, 

215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 
249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 
283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311, 313, 315, 
317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 

20 351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383 

385, or variants thereof, and/or at least one amino acid sequence selected from the 
group consisting of: SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 
34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 
80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 

25 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 

154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 
188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 
222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 
256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 

30 290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 

324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 
358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384 and 386, and 
variants thereof. 
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45. A computer system comprising a processor and a data storage device, wherein the data 
storage device has stored thereon at least one nucleotide sequence selected from the 
group consisting of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 

5 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 

81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 
121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 
155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 
189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 

10 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 25 1, 253, 255, 

257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 285, 287, 289, 
291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311, 313, 315, 317, 319, 321, 323, 
325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 357, 
359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 385, and variants 

15 thereof and/or at least one amino acid sequence selected from the group consisting of: 

SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 
44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 
90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 
128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 

20 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 

196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 
230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 
264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 
298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 

25 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 

366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, and variants thereof. 

46. A method for identifying a feature in a sequence which comprises: 

(a) inputting the sequence into a computer; 
30 (b) running a sequence feature identification program on the computer so as to 

identify a feature within the sequence; and 
(c) identifying the feature in the sequence, 
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wherein the sequence comprises SEQ ID NOS: 1-386, variants, or any combination 
thereof. 



47. An assay for identifying a functional fragment of a polypeptide which comprises: 
5 (a) obtaining a fragment of at least one polypeptide of claim 16 or claim 17; 

(b) contacting at least one fragment from step (a) with a substrate having a 
cyanohydrin moiety or an aminonitrile moiety under reaction conditions suitable for 
nitrilase activity; 

(c) measuring the amount of reaction product produced by each at least one fragment 
10 from step (b); and 

(d) identifying the at least one fragment which is capable of producing a nitrilase 
reaction product; thereby identifying a functional fragment of the polypeptide. 

48. An assay for identifying a functional variant of a polypeptide which comprises: 

15 (a) obtaining at least one variant of at least one polypeptide of claim 16 or claim 17; 

(b) contacting at least one variant from step (a) with a substrate having a cyanohydrin 
moiety or an aminonitrile moiety under reaction conditions suitable for nitrilase activity, 

(c) measuring the amount of reaction product produced by each at least one variant 
from step (b); and 

20 (d) identifying the at least one variant which is capable of producing a nitrilase 

reaction product; thereby identifying a functional variant of the polypeptide. 

49. An assay for screening enantioselective transformation comprising: 

(a) labeling one of two prochiral or enantiotopic moieties in a molecule; 

(b) modifying at least one of the two moieties by selective catalyst to produce 
25 products; and 

(c) determining the resultant products by mass spectroscopy. 

50. The assay of claim 49, wherein the label is heavier or lighter isotope. 

30 51. The assay of claim 49, wherein the selective catalyst is an enzyme. 

52. The assay of claim 49, wherein the use of mass spectroscopy is by a positive mode or 
a negative mode. 
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53. The assay of claim 49, wherein the analysis is of either a parental mass or a 
fragmentation mass. 

54. The assay of claim 49, wherein the assay can be used to monitor or determine % 
enantiomeric excess or % diasteromeric excess. 

55. An isolated or recombinant polypeptide having a nitrilase activity comprising a 
sequence as set forth in SEQ ID NO:196, 206, 208, 210 or 238 and having one or 
more mutations selected from the group consisting of a mutation at residue 55 lysine, 
residue 55 glycine, residue 55 glutamine, residue 60 glutamic acid, residue 1 1 1 serine, 
residue 190, residue 190 serine, residue 190 histidine, residue 190 tyrosine, residue 
190 threonine, residue 191 leucine, residue 191 valine, residue 191 methionine, 
residue 191 aspartic acid, residue 191 glycine, residue 191 glutamic acid, residue 191 
tyrosine, residue 191 threonine, residue 199 glutamic acid, residue 199 leucine, 
residue 222 leucine, and any combination thereof. 

56. An isolated or recombinant polypeptide having a nitrilase activity comprising a 
sequence as set forth in SEQ ID NO:196, 206, 208, 210 or 238 and having a mutation 
at residue 190 or equivalent, wherein alanine is replaced with a hydrogen-binding 
amino acid or peptidomimetic residue. 

57. An isolated or recombinant polypeptide having a nitrilase activity comprising a 
sequence as set forth in SEQ ID NO:196, 206, 208, 21 0 or 238 and having a mutation 
at residue 190 or equivalent, wherein alanine is replaced with a hydrophobic amino 
acid or peptidomimetic residue. 

58. An isolated or recombinant nitrilase having the equivalent of one or more mutations at 
residue 55 lysine, glycine, or glutamine; at residue 60 glutamic acid; at residue 1 1 1 
serine, at residue 190, serine, histidine, tyrosine or threonine; at residue 191, leucine, 
valine, methionine, aspartic acid, glycine, glutamic acid, tyrosine or threonine; at 
residue 199 glutamic acid or leucine; at residue 222 leucine of SEQ ID NO: 196, 206, 
208, 210 or 238. 
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59. An amplification primer pair for amplifying a nucleic acid encoding a polypeptide 
having a nitrilase activity, wherein the primer pair is capable of amplifying a nucleic 
acid comprising a sequence as set forth in claim 1, or a subsequence thereof. 

60. The amplification primer pair of claim 59, wherein a member of the amplification 
primer sequence pair comprises an oligonucleotide comprising at least about 10 to 50 
consecutive bases of the sequence, or, about 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 
23, 24, 25, 26, 27, 28, 29, 30 or more consecutive bases of the sequence. 

61. A nitrilase-encoding nucleic acid generated by amplification of a polynucleotide using 
an amplification primer pair as set forth in claim 59. 

62. The nitrilase-encoding nucleic acid of claim 61, wherein the amplification is by 
polymerase chain reaction (PCR). 

63. The nitrilase-encoding nucleic acid of claim 62, wherein the nucleic acid generated by 
amplification of a gene library. 

64. The nitrilase-encoding nucleic acid of claim 63, wherein the gene library is an 
environmental library. 

65. An isolated or recombinant nitrilase encoded by a nitrilase-encoding nucleic acid as set 
forth in claim 61. 
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Figurt 3 
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Figure 4 
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Figure 5 
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Figure 6A 
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4-fluoro phenyl glycinonitrile (R)-4-fluoro phenyl glycine (S)-4-fluoro phenyl glycine 
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Figure 16 




c 



CN 




COOH 



OH 



OH 




COOH 



OH 



3-phenyl propanal cyanohydrin (R)-2-hydroxy-4-phenyl butyric acid (S)-2-hydroxy-4-phenyl butyric 

acid 



2-amino hexanenitrile 



NH 2 NH 2 

"COOH "^^^-^COOH 
(R)-2-amino-hexanolc acid (S)-2-amino-hexanoic acid 



OH 

mandelonitrile 




OH 

(R)-mandeiic acid 



OH 



(S)-mandeiJc add 



OH 



OH QH 

Q^cooh QiP C00H 



2-pyrldyl formaldehyde cyanohydrin (R)-pyridyl glycollc acid (S>-pyridyl glycolic acid 
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Figure 16 



2-amino-6-hydroxy hexanenitrtle 



NH 2 

(R)-2-amino-6-hydroxy hexanoic acid ($)-2-amino-6-hydroxy hexanoic 

acid 



NH 2 



COOH 



XY 

NH 2 



COOH 



2-aminc-4,4-dimethyl pentanenitrile (R)-2-amirto-4 f 4-dimethyl pentanotc acid (S)-2-amino-4,4-dimethy1 pentanoic 

add 



HOOC JM 




(S)-cyctohexyt mandelic acid 




(R)-cydohexyl mandelic acid 



OCH 3 



H3CO' 



OCH3 NH 2 
HaCC^ — ^-^COOH 



OCH3 ^H 2 
HaCO^^^^^COOH 



2-amino-6,6-dimethoxy hexanenitrile (R)-2-amino-8,6-dimethoxy hexanoic acid(S)-2-amino-6 ( 6-dimethoxy hexanoic 

add 



crc 



OH 

phenyl acetaJdehyde cyanohydrin 



(R>-phenyl tactic acid 



one 



^COOH 
OH 

(S)-phenyl lactic add 



OH 

hydroxyglutarytnitrile 



OH QH 
NC^l^COOH NC^x^COOH 
(RM3>-hydroxy-4-cyano-butyric add (SH3)-hydroxy-4<yano-butyric add 
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2-bromo mandelonilrile 
CH 3 OH 




2-methyl mandelonitrile 
OH 



CI 



3-chloro mandelonitrile 
QH 

BfN 




CN 



3-bromo mandelonitrile 



Figure 16 

Br OH 



Br OH 




COOH 



(R)-2-bromo mandelic acid (S)-2-bromo mandelic acid 

CH 3 OH 

"COOH 



CH 3 OH 
^COOH 

(R)-2 -methyl mandelic acid (S)-2-methyl mandelic acid 




OH 

c,s O^ C00H 



QH 

clv O^ C00H 

(RV-3-chloro mandelic acid (S)-3-chloro mandelic acid 
OH QH 

^V !S? V x ^ s cooh 



(R)-3-bromo mandelic acid (S)-3-broiTK> mandelic acid 
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OH 



Figure 16 




OH 



CN 



4-fluoro mandelonitrile 
QH 




PCTAJS03/15712 



OH 



COOH 




f jOT cooh 

(R)-4-fluoro mandelic acid (S)-4-fluoro mandelic acid 
OH 




OH 



COOH ^^y^COOH 



2-napthyt formaldehyde cyanohydrln (R)-2-napthylglycolic acid (S)-2-napthyiglycolic acid 

QH OH QH 

"COOH r^V" ^COOH 






1-napthyl formaldehyde cyanohydrin 
OH 



N 



(R)-1-napthytglycollc acid (S>-1 -napthylglycolic acid 
QH QH 



COOH 



3-pyridyl formaldehyde cyanohydrin 
OH 



Q-cooh 

N 

(R)-3-pyridyl glycolic acid (S)-3-pyridyl glycolic acid 



Is. J} 
N 



0^ 



OH OH 
COOH ^y^COOH 



3-thienyl formaldehyde cyanohydrin (R)-3-thienyl glycolic acid (S)-3-thienyl glycolic acid 
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crx 



CN 
OH 



phenyl acetaldehyde cyanohydrin 




(R)-phenyl lactic acid 



COOH 



cnr 

(S)-phenyl lactic acid 




COOH 



0H 0H KjJ OH 

2-methyl phenyl acetaldehyde cyanohydrin (R)-2-methyl phenyl lactic acid (S)-2-methyl phenyl lactic acid 

^ Br B r 

' C00H /L^yCOOH 

0H KsT OH 

2-bromo phenyl acetaldehyde cyanohydrin (R)-2-bromo phenyl lactic acid (S)-2-bromo-phenyl lactic acid 

■ ? P 

' CN \^COOH J^^COOH 

^ 0H U OH (J } H 

2-fluoro phenyl acetaldehyde cyanohydrin (R)-2-fluoro-phenyt lactic acid (S>-2-fluoro-phenyl lactic acid 



ire 

henyl acetaldet 

6rt 




CN 
OH 



; -v^W^COOH F^^^, 

Tj 6h ijX« 



COOH 



0H OH 
3-fluoro phenyl acetaldehyde cyanohydrin (R)-3-fluoro-phenyi lactic acid (S}-3-fluoro-phenyl lactic acid 
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Figure 16 



\sj OH 



-methyl phenyl acetaldehyde cyanohydrin 

.CN 

OH 




1-napthyl acetaldehyde cyanohydrin 



S— ' OH 
3-thiophene acetaldehyde cyanohydrin 



OH 

2-thiophene acetaldehyde cyanohydrin 
OH 

2-pyridine acetaldehyde cyanohydrin 
„CN 



One 




COOH h 3 C. 




COOH 



OH li^J OH 

(R)-3-methyl-phenyi lactic acid (S)-3-methyl-phenyl lactic acid 

.COOH 

OH 





COOH 



(RH-napthyl lactic acid 
"V^COOH 

6h 



(S)-1-napthyl lactic acid 
✓COOH 



S— ' OH 

(R)-3-thiophene lactic acid (S)-3-thiophene lactic acid 



COOH 



OH 



OH 



COOH 



(R)-2-thiophene lactic acid (S)-2-thiophene lactic acid 
f*Y^S C0OH ^W^COOH 

6h l^A oh 

(R)-2-pyridlne lactic acid (S)-2-pyridine lactic acid 




COOH 



N 



OH 



COOH 



3-pyridine acetaldehyde cyanohydrin (R)-3-pyridine lactic acid 



N ° H 
(S)-3-pyridine lactic acid 
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Figure 17 



OH 

proR selective Y n 

NC^A^COOH 



OH 

(RH-cyano-3-hydroxy butyric acid 

Hydroxyglutaryl nitrile (HGN) 

proS selectiw^ qh 
nitnlase 



(S)-4-cyano-3-hydroxy butyric acid 
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SEQUENCE LISTING 



<110> DIVERSA CORPORATION 



<120> Nitrilases, Nucleic Acids Encoding Them and Methods for Making and 
Using Them 



<130> Docket No. 09010-120WO4 



<140> Unknown 
<141> 2003-05-15 



<150> US 10/241,742 
<151> 2002-09-09 

<150> US 10/146,772 
<151> 2002-05-15 



<150> US 60/309,006 
<151> 2001-07-30 



<150> US 60/351,336 
<151> 2002-01-22 

<150> US 60/300,189 
<151> 2001-06-21 



<150> US 09/751,299 
<151> 2000-12-28 



<150> US 60/254,414 
<151> 2000-12-07 

<150> US 60/173,609 
<151> 1999-12-29 



<160> 386 



<170> FastSEQ for Windows Version 4.0 



<210> 1 
<211> 939 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 1 

atggaaaagt 

gtgaatatag 

attgctttgc 

attcagacgc 

gtattgatag 

atgtgttatg 

gaaaatgctt 

aatacgcgcg 

cttcgcaatc 

ttccccggtt 



atattaaagt 
aaaattgcga 
cggagttctt 
ctgacggcaa 
gaggctcatt 
ccaacggcgc 
tttatgaagg 
ttggtgcagc 
aggtggatgt 
ttgtgcaaaa 



cgccgcaatt 
acgtttggca 
caatacgggc 
ggctgcgatg 
tctgtgcagg 
tctcgtgggc 
tggggattcc 
cgtctgttgg 
catcatgggt 
gctgtgggaa 



cagatagcta 
ttatcggcgg 
gttagttgga 
ttcttacgcg 
ttgccggatg 
aaacatgaca 
agcgatattg 
gagttcatgc 
ggttcctgct 
cctggaaata 



caaaaatagc 
tcaatgaggg 
acaaaaaaat 
acttttctgc 
gcagtgtgcg 
aagacctacc 
gggtgctggg 
ggacgatgac 
ggtggagcat 
gccgcaacgc 



agattcaccc 
tgcgcgttgg 
tgccttggct 
aagacatcat 
caaccgctat 
cacgatgtgg 
aacatttgaa 
tgcccggcgt 
accgaccaat 
gcttgctgcc 



60 
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180 
240 
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360 
420 
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540 
600 
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atacaggata atgcgcgtct cattggcgtg ccggttgttc atgccgctca ttgcggtgaa 660 

attgagtgtc cgatgccagg attgccgata ggttacaggg ggttctttga gggtaacgcg 720 

gccattgtga atgcagaagg tcaggtgctt gcgcatcggg gtgctggcga gggcgaagga 780 

attgtttgcg cggagatttt accggtagcc aaatcaaaca ggtcggaaat tcccaatcgt 840 

tactggttgc gctgcagagg ctttctacct atttttgcct ggcatcagca acgttggttg 900 

ggaaggcatt ggtatttgcg caatgtgcgc aggacttaa 939 

<210> 2 
<211> 312 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 2 














Met 


Glu 


Lys 


Tyr 


He 


Lys 


Val 


Ala 


1 








5 








Ala 


Asp 


Ser 


Pro 
20 


Val 


Asn 


He 


Glu 


Ala 


Val 


Asn 


Glu Gly Ala 


Arg 


Trp 






35 










40 


Thr 


Gly 


Val 


Ser 


Trp Asn 


Lys 


Lys 




50 










55 




Asp 


Gly 


Lys 


Ala 


Ala 


Met 


Phe 


Leu 


65 










70 






Val 


Leu 


He 


Gly Gly Ser 


Phe 


Leu 










85 








Arg 


Asn 


Arg 


Tyr Met 


Cys 


Tyr 


Ala 








100 










Asp 


Lys 


Asp 
115 


Leu 


Pro 


Thr 


Met 


Trp 
120 


Asp 


Ser 


Ser 


Asp He Gly 


Val 


Leu 




130 










135 




Gly 


Ala 


Ala 


Val 


Cys 


Trp 


Glu 


Phe 


145 










150 






Leu 


Arg 


Asn 


Gin 


Val 
165 


Asp 


Val 


He 


lie 


Pro 


Thr 


Asn 
180 


Phe 


Pro 


Gly 


Phe 


Asn 


Ser 


Arg 
195 


Asn 


Ala 


Leu 


Ala 


Ala 
200 


Gly 


Val 
210 


Pro 


Val 


Val 


His 


Ala 
215 


Ala 


Met 


Pro 


Gly 


Leu 


Pro 


He 


Gly 


Tyr 


225 










230 






Ala 


He 


Val 


Asn 


Ala 
245 


Glu 


Gly 


Gin 


Glu 


Gly 


Glu 


Gly He Val 


Cys 


Ala 








260 










Asn 


Arg 


Ser 
275 


Glu 


He 


Pro 


Asn 


Arg 
2B0 


Leu 


Pro 
290 


He 


Phe 


Ala 


Trp 


His 
295 


Gin 


Tyr 


Leu 


Arg 


Asn 


Val 


Arg 


Arg 


Thr 


305 










310 







<210> 3 
<211> 981 
<212> DNA 



Ala 


He 


Gin He Ala Thr Lys 


He 




10 


15 




Asn 


Cys 


Glu Arg Leu Ala Leu 


Ser 


25 




30 




He 


Ala 


Leu Pro Glu Phe Phe 


Asn 






45 




He 


Ala 


Leu Ala He Gin Thr 


Pro 






60 




Arg 


Asp 


Phe Ser Ala Arg His 


His 






75 


80 


Cys 


Arg 


Leu Pro Asp Gly Ser 


Val 




90 


95 




Asn 


Gly 


Ala Leu Val Gly Lys 


His 


105 




110 




Glu 


Asn 


Ala Phe Tyr Glu Gly Gly 






125 




Gly 


Thr 


Phe Glu Asn Thr Arg 


Val 






140 




Met 


Arg 


Thr Met Thr Ala Arg Arg 






155 


160 


Met 


Gly 


Gly Ser Cys Trp Trp 


Ser 




170 


175 




Val 


Gin 


Lys Leu Trp Glu Pro Gly 


185 




190 




He 


Gin 


Asp Asn Ala Arg Leu 


He 






205 




His 


Cys 


Gly Glu He Glu Cys 


Pro 






220 




Arg 


Gly 


Phe Phe Glu Gly Asn Ala 






235 


240 


Val 


Leu 


Ala His Arg Gly Ala Gly 




250 


255 




Glu 


He 


Leu Pro Val Ala Lys 


Ser 


265 




270 




Tyr 


Trp 


Leu Arg Cys Arg Gly 


Phe 






285 




Gin 


Arg 


Trp Leu Gly Arg His 


Trp 



300 
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<213> Unknown 
<220> 

<223> Obtained from an environmental sample 



<400> 3 

atggggaatt 

gcgacagtcg 

gtggtctttc 

gggcattcgc 

ccgggcgatg 

atcggcatca 

tacatcggag 

gagcgcctgg 

ggccgattgg 

tcggcatggg 

tccacaatgc 

atgaaaattg 

gagggctggc 

ggcccggcaa 

gggcccaggt 

atcgtgcacc 

gagcctgggc 



cgttcaagat 
ccaaggcatg 
cggaggcgtt 
aaccactgcg 
ccaccgatcg 
atgaagtgaa 
cggacggcgc 
tctgggcgct 
gtggcctgtt 
gaaccgagat 
ggcatatcgc 
acgatgtacc 
tcaaccccgg 
gcgagcagga 
ggcagttgga 
gcgaacctaa 
cggaggcctg 



cgcggtggta 
ccgcctgatc 
tgtgcccgga 
ggagttatac 
gctttgcgcg 
cagcgaagcc 
gattctgggc 
tggcgatggg 
gtgctgggag 
ctacgtggct 
gaaagaaggg 
cgaccggctg 
cgatagcgta 
gacgattctg 
tgtggccggc 
gcgatttttg 
a 



caagcctgtc 
gcggaggcag 
tacccactgt 
gccgaactgg 
gcagccagag 
agcggcacga 
aaacaccgca 
agcgacctgg 
aactacatgc 
ccaacttggg 
cgatgctacg 
gcgttcaaag 
atcgtcgatc 
tatgccgatg 
cactacgcgc 
acgatagctc 



cggtctttct 
ccgcggcggg 
gggtctggtt 
tggggaacgc 
aagccggcgt 
cgatttacaa 
aagtaatgcc 
aggtctacga 
ccctggcccg 
atcgcggaga 
tagtgggatg 
ggaagtatct 
cggacggcaa 
ccgaccggtc 
ggccggatat 
cgcggacgaa 



ggatcgtggc 
cgccagcctg 
cattccggca 
cgtggcggta 
ggtagtggcg 
tacgctgctg 
gacgggcgga 
cctgcctttc 
gtacgcgatg 
accgtggctg 
ctgcagttgc 
gtcgacggcc 
gctgatcgcg 
taagatcacc 
cttcgaactg 
ggaggagcgg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
981 



<210> 4 
<211> 326 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 4 


























Met 


Gly 


Asn 


Ser 


Phe 


Lys 


He 


Ala 


Val 


Val 


Gin 


Ala Cys Pro 


Val 


Phe 


1 








5 










10 






15 




Leu 


Asp 


Arg 


Gly 
20 


Ala 


Thr 


Val 


Ala 


Lys 
25 


Ala 


Cys 


Arg Leu He 
30 


Ala 


Glu 


Ala 


Ala 


Ala 
35 


Ala 


Gly 


Ala 


Ser 


Leu 
40 


Val 


Val 


Phe 


Pro Glu Ala 
45 


Phe 


Val 


Pro 


Gly 
50 


Tyr 


Pro 


Leu 


Trp 


val 
55 


Trp 


Phe 


He 


Pro 


Ala Gly His 
60 


Ser 


Gin 


Pro 


Leu 


Arg 


Glu 


Leu 


Tyr 


Ala 


Glu 


Leu 


Val 


Gly 


Asn Ala Val 


Ala 


Val 


65' 










70 










75 






80 


Pro 


Gly 


Asp 


Ala 


Thr 


Asp 


Arg 


Leu 


Cys 


Ala 


Ala 


Ala Arg Glu Ala Gly 










85 










90 






95 




Val 


Val 


Val 


Ala 
100 


lie 


Gly 


lie 


Asn 


Glu 
105 


Val 


Asn 


Ser Glu Ala 
110 


Ser 


Gly 


Thr 


Thr 


lie 


Tyr 


Asn 


Thr 


Leu 


Leu 


Tyr 


He 


Gly 


Ala Asp Gly Ala 


He 






115 










120 








125 






Leu 


Gly 
130 


Lys 


His 


Arg 


Lys 


Val 
135 


Met 


Pro 


Thr 


Gly 


Gly Glu Arg 
140 


Leu 


Val 


Trp 


Ala 


Leu 


Gly 


Asp 


Gly 


Ser 


Asp 


Leu 


Glu 


Val 


Tyr Asp Leu 


Pro 


Phe 


145 










150 










155 






160 


Gly 


Arg 


Leu 


Gly 


Gly 
165 


Leu 


Leu 


Cys 


Trp 


Glu 
170 


Asn 


Tyr Met Pro 


Leu 
175 


Ala 


Arg 


Tyr 


Ala 


Met 
180 


Ser 


Ala 


Trp 


Gly 


Thr 
185 


Glu 


He 


Tyr Val Ala 
190 


Pro 


Thr 


Trp 


Asp 


Arg 
195 


Gly 


Glu 


Pro 


Trp 


Leu 
200 


Ser 


Thr 


Met 


Arg His He 
205 


Ala 


Lys 
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Glu Gly 


Arg 


Cys Tyr 


Val 


Val 


Gly 


Cys 


Cys 


Ser 


Cys 


Met 


Lys He Asp 


210 








215 










220 






Asp Val 


Pro 


Asp Arg 


Leu 


Ala 


Phe 


Lys 


Gly 


Lys 


Tyr 


Leu 


Ser Thr Ala 


225 






230 










235 






240 


Glu Gly 


Trp 


Leu Asn 
245 


Pro 


Gly 


Asp 


Ser 


Val 
250 


He 


Val 


Asp 


Pro Asp Gly 
255 


Lys Leu 


He 


Ala Gly 
260 


Pro 


Ala 


Ser 


Glu 
265 


Gin 


Glu 


Thr 


He 


Leu Tyr Ala 
270 


Asp Ala 


Asp 
275 


Arg Ser 


Lys 


He 


Thr 
280 


Gly 


Pro 


Arg 


Trp 


Gin 
285 


Leu Asp Val 


Ala Gly 


His 


Tyr Ala 


Arg 


Pro 


Asp 


He 


Phe 


Glu 


Leu 


He 


Val His Arg 


290 








295 










300 






Glu Pro 


Lys 


Arg Phe 


Leu 


Thr 


He 


Ala 


Pro 


Arg Thr 


Lys 


Glu Glu Arg 


305 






310 










315 






320 


Glu Pro 


Gly 


Pro Glu 
325 


Ala 



















<210> 5 
<211> 1005 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 5 

atgggtacca agcacccggc 
gatctcgacc gcaccgtcga 
gcgaagctca ttgcgttccc 
ggcacgccgg cgtgggcgat 
gcctacgaca gcccgcaggc 
gtggtgctcg gcctgtcgga 
gggccggacg gcgagaccat 
accgtgttcg gcgatggcga 
cgcctcggcg cgctgtgctg 
gcccagaacg agcaggtgca 
gcccacgcgc tcggctggga 
tcgtgtttcg tgctcggcgc 
gacaccgagg acaagcgggc 
ccggacggca gatcgctggc 
gacatcgacc tcggcgcaat 
tcgcgcccgg acgtgacgcg 
ttctcgctgc cggtcgacgc 

<210> 6 
<211> 334 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 6 

Met Gly Thr Lys His Pro Ala Phe Lys Ala Ala Val 

15 10 
Pro Glu Trp Leu Asp Leu Asp Arg Thr Val Asp Lys 

20 25 
He Glu Glu Ala Ala Gly Ala Gly Ala Lys Leu He 
35 40 



cttcaaggcc gcagtggtcc aggccgcgcc ggaatggctc 60 

caagaccatc gcgctgatcg aggaggccgc cggcgccggc 120 

ggaaacctgg attcccggct atccgtggca catctgggtc 180 

cagccgcggc ttcgtgcagc gctacttcga caattcactg 24 0 

ccagcgcatc gcggacgccg cgaagaagaa caagatcacc 300 

gcgcgagggt ggcagccttt atatctcgca gtggctgatt 360 

tgccaagcgc cgcaaactgc gccccaccca cgtcgagcgc 420 

cggcagccac atcgcggtgc acgagcgtgc tgacatcggc 480 

ctgggagcac atccagccgc tgaccaaata cgccatgtat 540 

cgtcgccgcc tggccgagct tctcgatgta cgagccgttc 600 

agtcaacaat gcggcgagca agatctacgc cgtcgaaggc 660 

atgcgcggtg atctcgcagg cgatggtcga cgaaatgtgc 720 

gctggtccat gccggcggcg gccacgcggt gatcttcggg 780 

ggacaagatt ccggagaccc aggaaggcct gctctatgcc 840 

tggcgtggcc aagaacgcgg ccgatccggc ggggcactac 900 

gctcctgttc aacaacaagc cggcgcgccg ggtcgagtat 960 

ggtcgagacg ccgccgcagc cctga 1005 



Val Gin Ala Ala 
15 

Thr He Ala Leu 
30 

Ala Phe Pro Glu 
45 
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Thr 


Trp 


lie 


Pro Gly 


Tyr 


Pro 


Trp 


HIS 


lie 


Trp 


Val 


Gly 


Thr 


Pro 


Ala 












55 








60 








Trp 


Ala 


lie 


Ser Arg 


Gly 


Pne 


Val 


Gin 


Arg 


Tyr 


Phe 


Asp 


Asn 


Ser 


Leu 


65 








70 










75 










80 


Ala 


Tyr 


Asp 


Ser Pro 


Gin 


Ala 


Gin 


Arg 


lie 


Ala 


Asp 


Ala 


Ala 


Lys 


Lys 








85 










90 










95 


Asn 


Lys 


lie 


Thr Val 
100 


Val 


Leu 


Gly 


Leu 
105 


Ser 


Glu 


Arg 


Glu 


Gly 
110 


Gly 


Ser 


Leu 


Tyr 


lie 
115 


Ser Gin 


Trp 


Leu 


lie 
120 


Gly 


Pro 


Asp 


Gly 


Glu 
125 


Thr 


He 


Ala 


Lys 


Arg 


Arg 


Lys Leu 


Arg 


Pro 


Thr 


His 


Val 


Glu 


Arg 


Thr 


Val 


Phe 


Gly 




130 








135 










140 








Asp 


Gly 


Asp 


Gly Ser 


His 


lie 


Ala 


Val 


His 


Glu 


Arg 


Ala 


Asp 


He 


Gly 


145 








150 










155 










160 


Arg 


Leu 


Gly 


Ala Leu 


Cys 


Cys 


Trp 


Glu 


His 


lie 


Gin 


Pro 


Leu 


Thr 


Lys 








165 










170 










175 


Tyr 


Ala 


Met 


Tyr Ala 


Gin 


Asn 


Glu 


Gin 


Val 


His 


Val 


Ala 


Ala 


Trp 


Pro 








180 








185 










190 




Ser 


Phe 


Ser 


Met Tyr 


Glu 


Pro 


Phe 


Ala 


His 


Ala 


Leu 


Gly 


Trp 


Glu 


Val 






195 








200 










205 






Asn 


Asn 
210 


Ala 


Ala Ser 


Lys 


lie 
215 


Tyr 


Ala 


Val 


Glu 


Gly 
220 


Ser 


Cys 


Phe 


Val 


Leu 


Gly 


Ala 


Cys Ala 


Val 


lie 


Ser 


Gin 


Ala 


Met 


Val 


Asp 


Glu 


Met 


Cys 


225 








230 










235 










240 


Asp 


Thr 


Glu 


Asp Lys 
245 


Arg 


Ala 


Leu 


Val 


His 
250 


Ala 


Gly 


Gly 


Gly 


His 
255 


Ala 


Val 


lie 


Phe 


Gly Pro 
260 


Asp 


Gly 


Arg 


Ser 
265 


Leu 


Ala 


Asp 


Lys 


He 
270 


Pro 


Glu 


Thr 


Gin 


Glu 
275 


Gly Leu 


Leu 


Tyr 


Ala 
280 


Asp 


lie 


Asp 


Leu 


Gly 
285 


Ala 


He 


Gly 


Val 


Ala 
290 


Lys 


Asn Ala 


Ala 


Asp 
295 


Pro 


Ala 


Gly 


His 


Tyr 
300 


Ser 


Arg 


Pro 


Asp 


Val 


Thr 


Arg 


Leu Leu 


Phe 


Asn 


Asn 


Lys 


Pro 


Ala 


Arg 


Arg 


Val 


Glu 


Tyr 


305 








310 










315 










320 


Phe 


Ser 


Leu 


Pro Val 
325 


Asp 


Ala 


Val 


Glu 


Thr 
330 


Pro 


Pro 


Gin 


Pro 







<210> 7 
<211> 999 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 7 

atgccgagtg attatcatgc tccattcaaa gtagcagttg tccaggcaac tcccgtcttt 60 

ctcgatcgca gcgcgacgat tgagaaggca tgtgagctaa ttgcctgtgc tggacgtgag 120 

ggcgcacgtc tgatcgtgtt tcctgaagcg ttcattccca cctatcccga ttgggtctgg 180 

accattccac ctggggagat gcggctgctt ggcgaactct acacagagtt gcttgccaat 240 

gcggtcacga tccccagtaa tgcaacggat aggctctgcc aggctgcgaa acgagctgct 300 

gcgtatgtgg tcatgggaat gaacgaacgc aatatcgagg cgagtggaag gagtctctat 360 

aacaccctgt tatacatcga tgctcagggc cagatcatgg gcaaacaccg caagttgata 420 

cccacagccg gtgagcggct catatgggcg caaggagatg ggagtacatt ccaggtctac 480 

gatactcctc tgggcaaact gggagggctc atctgctggg aaaactacat gcctctggct 54 0 

cgctatgcga tgtatgcctg gggcacgcag atttatgtcg ccccgacatg ggatcgtggc 600 

aacctctggc tctctactct gcggcatatc gctaaggagg gaggcgtcta tgttcttggt 660 

tgtagtatgg tcatgcgcaa gaatgacatt cccgatcact ttgctttcaa agagcagttt 720 

tatgctactg tggacgaatg gatcaacgtt ggtgacagcg ccattgtcca tcccgagggg 780 

aactttcttg cgggaccggt gcgccacaaa gaagagattc tctatgcaga acttgatcca 84 0 
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cgccaatcgt gcggtccggg atggatgctc gatgtggctg ggcactatgc acgccctgat 900 
gtgtttgaat tgattgtcca cacagagatg cgacccatga tgaagcaaga agaggtagga 960 
ggagaaaata catctgaggg aggtgtacga ttcttgtaa 999 

<210> 8 
<211> 332 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 8 



Met Pro 


Ser 


Asp 


Tyr 


His 


Ala 


Pro 


Phe Lys 


Val Ala Val Val Gin Ala 


1 






5 








10 


15 


Thr Pro 


Val 


Phe 


Leu 


Asp 


Arg 


Ser 


Ala Thr 


He Glu Lys Ala Cys Glu 






20 










25 


30 


Leu lie 


Ala 


Cys 


Ala 


Gly 


Arg 


Glu 


Gly Ala 


Arg Leu lie Val Phe Pro 




35 










40 




45 


Glu Ala 


Phe 


He 


Pro 


Thr 


Tyr 


Pro 


Asp Trp 


Val Trp Thr He Pro Pro 


50 










55 






60 


Gly Glu 


Met 


Arg 


Leu 


Leu 


Gly 


Glu 


Leu Tyr 


Thr Glu Leu Leu Ala Asn 


65 








70 








75 80 


Ala Val 


Thr 


He 


Pro 


Ser 


Asn 


Ala 


Thr Asp 


Arg Leu Cys Gin Ala Ala 








85 








90 


95 


Lys Arg 


Ala 


Ala 


Ala 


Tyr 


Val 


Val 


Met Gly 


Met Asn Glu Arg Asn He 






100 










105 


110 


Glu Ala 


Ser 


Gly Arg 


Ser 


Leu 


Tyr 


Asn Thr 


Leu Leu Tyr He Asp Ala 




115 










120 




125 


Gin Gly 


Gin 


He 


Met 


Gly 


Lys 


His 


Arg Lys 


Leu He Pro Thr Ala Gly 


130 










135 






140 


Glu Arg 


Leu 


He 


Trp 


Ala 


Gin 


Gly 


Asp Gly 


Ser Thr Phe Gin Val Tyr 


145 








150 








155 160 


Asp Thr 


Pro 


Leu Gly 


Lys 


Leu 


Gly 


Gly Leu 


He Cys Trp Glu Asn Tyr 








165 








170 


175 


Met Pro 


Leu 


Ala 


Arg 


Tyr 


Ala 


Met 


Tyr Ala 


Trp Gly Thr Gin He Tyr 






180 










185 


190 


Val Ala 


Pro 


Thr 


Trp 


Asp 


Arg 


Gly 


Asn Leu 


Trp Leu Ser Thr Leu Arg 




195 










200 




205 


His He 


Ala 


Lys 


Glu 


Gly 


Gly 


Val 


Tyr Val 


Leu Gly Cys Ser Met Val 


210 










215 






220 


Met Arg 


Lys 


Asn Asp 


He 


Pro 


Asp 


His Phe 


Ala Phe Lys Glu Gin Phe 


225 








230 








235 240 


Tyr Ala 


Thr 


Val 


Asp 


Glu 


Trp 


He 


Ash Val 


Gly Asp Ser Ala He Val 








245 








250 


255 


His Pro 


Glu 


Gly Asn 


Phe 


Leu 


Ala 


Gly Pro 


Val Arg His Lys Glu Glu 






260 










265 


270 


He Leu 


Tyr 


Ala 


Glu 


Leu 


Asp 


Pro 


Arg Gin 


Ser Cys Gly Pro Gly Trp 




275 










280 




285 


Met Leu 


Asp 


Val 


Ala 


Gly 


His 


Tyr 


Ala Arg 


Pro Asp Val Phe Glu Leu 


290 










295 






300 


He Val 


His 


Thr 


Glu 


Met 


Arg 


Pro 


Met Met 


Lys Gin Glu Glu Val Gly 


305 








310 








315 320 


Gly Glu 


Asn 


Thr 


Ser 


Glu 


Gly 


Gly 


Val Arg 


Phe Leu 



325 330 



<210> 9 
<211> 945 
<212> DNA 
<213> Unknown 
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<220> 

<223> Obtained from an environmental sample 



<400> 9 

atggcggctc acaagatcgc ggtggttcag gcgcccagcg ttctcctcga tcgcgagggc 60 

tcggtcgcgc gcgcggtcac gctgctcgac gaggcggcgg cggccggcgc ccgcctggtc 120 

gtgtttccgg aggcctacat ccccggctac ccggactgga tctggcgcct gcgcccctac 180 

ccggacgtca agctggccgc cgagctgcac gaacggctgc tcgccaacgc ggtggatctc 240 

tccaccgacg tgctggcgcc ggtgctggcg gcggcggcgc gtcacgggct caccgtggtc 300 

atgtgcgtgc aggagcgcga cgccggattc agccgcgcca cactttacaa caccgcgctg 360 

gtcatcgacg ccgccggcaa gatcgcgaac cggcaccgca agctcatgcc caccaacccc 420 

gagcgaatgg tgtggggatt cggtgacgcc tcggggctgc gggtggtgag cacgcccgtc 480 

gggcgggtgg gcacgctcct gtgctgggag agctacatgc ccctggcgcg ctgcgcgctc 540 

tacgccgagg gggtcgagat ctacgtgacc ccgacctggg actacggcga aggctggcgc 600 

gccagcatgc agcacatcgc ccgcgagggg cgctgctggg tggtgaccgc ttgcatgtgc 660 

gtgcaggcgc gcgacgtgcc ggccgacttc cccgggcgcg cccagctcta ccccgacgag 720 

gaggagtggt tgaaccccgg cgattcgctg gtggtcgacc ccggcggcaa gatcgtggcc 780 

ggtccgatgt cgcgcgagaa ggggatcttg tacgcggaga tcgatccgga tcgcgtggcg 840 

ggggcgcacc gctcgttcga cgtcgtgggc cactactcgc gtcccgacgt gttccggctg 900 

gaggtcgatc ggacaccggc ggcgccggtg agcttcaaaa aatga 945 

<210> 10 
<211> 314 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 10 



Met 


Ala 


Ala 


His 


Lys 


He 


Ala 


Val 


Val 


Gin 


Ala 


Pro 


Ser 


Val Leu Leu 


1 








5 










10 








15 


Asp 


Arg 


Glu 


Gly 


Ser 


Val 


Ala 


Arg 


Ala 


Val 


Thr 


Leu 


Leu Asp Glu Ala 








20 










25 










30 


Ala 


Ala 


Ala 
35 


Gly 


Ala 


Arg 


Leu 


Val 
40 


Val 


Phe 


Pro 


Glu 


Ala 
45 


Tyr He Pro 


Gly 


Tyr 


Pro 


Asp 


Trp 


He 


Trp 


Arg 


Leu 


Arg 


Pro 


Tyr 


Pro Asp Val Lys 




50 










55 










60 






Leu 


Ala 


Ala 


Glu 


Leu 


His 


Glu 


Arg 


Leu 


Leu 


Ala 


Asn 


Ala 


Val Asp Leu 


65 










70 










75 






80 


Ser 


Thr 


Asp 


Val 


Leu 


Ala 


Pro 


Val 


Leu 


Ala 


Ala 


Ala 


Ala Arg His Gly 










85 










90 








95 


Leu 


Thr 


Val 


Val 


Met 


Cys 


Val 


Gin 


Glu 


Arg 


Asp Ala 


Gly Phe Ser Arg 








100 










105 










110 


Ala 


Thr 


Leu 


Tyr 


Asn 


Thr 


Ala 


Leu 


Val 


He 


Asp Ala 


Ala 


Gly Lys He 






115 










120 










125 




Ala 


Asn 
130 


Arg 


His 


Arg 


Lys 


Leu 
135 


Met 


Pro 


Thr 


Asn 


Pro 
140 


Glu 


Arg Met Val 


Trp 


Gly 


Phe 


Gly 


Asp 


Ala 


Ser 


Gly 


Leu 


Arg 


Val 


Val 


Ser 


Thr Pro Val 


145 










150 










155 






160 


Gly 


Arg 


Val 


Gly 


Thr 
165 


Leu 


Leu 


Cys 


Trp 


Glu 
170 


Ser 


Tyr 


Met 


Pro Leu Ala 
175 


Arg 


Cys 


Ala 


Leu 


Tyr 


Ala 


Glu 


Gly 


Val 


Glu 


He 


Tyr 


val 


Thr Pro Thr 






180 










185 










190 


Trp 


Asp 


Tyr 
195 


Gly 


Glu 


Gly 


Trp 


Arg 
200 


Ala 


Ser 


Met 


Gin 


His 
205 


He Ala Arg 


Glu 


Gly 


Arg 


Cys 


Trp 


Val 


Val 


Thr 


Ala 


Cys 


Met 


Cys 


Val 


Gin Ala Arg 



210 215 220 
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Asp Val Pro Ala Asp Phe Pro Gly Arg Ala Gin Leu Tyr Pro Asp Glu 

225 230 235 , 

Glu Glu Trp Leu Asri Pro Gly Asp Ser Leu Val Val Asp Pro Gly Gly 

245 250 255 

Lvs lie Val Ala Gly Pro Met Ser Arg Glu Lys Gly lie Leu Tyr Ala 

260 2 65 270 

Glu lie Asp Pro Asp Arg Val Ala Gly Ala His Arg Ser Phe Asp Val 

275 280 285 

Val Glv His Tyr Ser Arg Pro Asp Val Phe Arg Leu Glu Val Asp Arg 

29 $ 295 300 

Thr Pro Ala Ala Pro Val Ser Phe Lys Lys 
305 310 

<210> 11 
<211> 966 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



atgactggat cttatcctaa agacacactg atcgttgggc tagctcaaat cgctcctgtc 
tggctggatc gggcggggac actgtcaaag atactggctc aagtccatgc ggcaaatcaa 
gcgggttgtc atctcgtagc atttggcgaa gggctgcttc ctggatatcc gttttggatt 
gagcgaacaa atggcgcgct gttcaactcg actgtacaaa aggaaatcca cgcgcattat 
atggatcagg cggtgcagat cgaagccggt catctcgatc cgctttgtgc aacagccaag 
aaatttggaa tcaccgttgt actcggatgc atcgaacgcc cactcgatcg gggcggtcac 
agcttgtatg caagtctggt atatattgat tccgagggca gcattcaatc cgtgcatcgc 
aaactaatgc caacctacga agaacgactt acctggtcgt caggcgatgg gcacggttta 
cgagtgcata ccttaggtgc gtttacggtg ggtggtctca actgttggga aaattggatg 
cccttggcgc gcgcagcgat gtatggtcag ggtgaagatt tacatgttgc gatctggcca 
ggcggttctc atctcacgca ggatattacc cgctttattg cgctcgaatc acgttcgtac 
gtattatctg tctccggtct gatgcgcgca accgattttc caaaagatac tccccatctt 
gcctccatcc tagctaaagg tgaagagatt cttgcgaatg gtggttcttg tattgcaggt 
cctgacggca agtgggtcgt tgggcctctt gtaggagaag agaagttaat tgtcgcaacc 
attgatcact gccgcgtgcg cgaagaacgt cagaatttcg atccttccgg ^attacagc 
cggcccgatg tactgcaatt aaaaatcaac agggaacgcc agagcacaat ttcatttagc 
gag tag 

<210> 12 
<211> 321 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
966 



<400> 12 

Met Thr Gly Ser Tyr Pro Lys Asp 

1 5 
lie Ala Pro Val Trp Leu Asp Arg 
20 

Ala Gin Val His Ala Ala Asn Gin 

35 40 
Glv Glu Gly Leu Leu Pro Gly Tyr 

50 55 
Gly Ala Leu Phe Asn Ser Thr Val 
65 70 



Thr Leu lie Val Gly Leu Ala Gin 

10 15 
Ala Gly Thr Leu Ser Lys He Leu 
25 30 
Ala Gly Cys His Leu Val Ala Phe 
45 

Pro Phe Trp He Glu Arg Thr Asn 
60 

Gin Lys Glu He His Ala His Tyr 
75 80 
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Met Asp Gin Ala Val Gin lie Glu Ala Gly His Leu Asp Pro Leu Cys 

85 90 95 

Ala Thr Ala Lys Lys Phe Gly He Thr Val Val Leu Gly Cys He Glu 

100 105 110 

Arg Pro Leu Asp Arg Gly Gly His Ser Leu Tyr Ala Ser Leu Val Tyr 

115 120 125 

He Asp Ser Glu Gly Ser He Gin Ser Val His Arg Lys Leu Met Pro 

130 135 140 

Thr Tyr Glu Glu Arg Leu Thr Trp Ser Ser Gly Asp Gly His Gly Leu 
145 150 155 160 

Arg Val His Thr Leu Gly Ala Phe Thr Val Gly Gly Leu Asn Cys Trp 

165 170 175 

Glu Asn Trp Met Pro Leu Ala Arg Ala Ala Met Tyr Gly Gin Gly Glu 

180 185 190 

Asp Leu His Val Ala He Trp Pro Gly Gly Ser His Leu Thr Gin Asp 

195 200 205 

He Thr Arg Phe He Ala Leu Glu Ser Arg Ser Tyr Val Leu Ser Val 

210 215 220 

Ser Gly Leu Met Arg Ala Thr Asp Phe Pro Lys Asp Thr Pro His Leu 
225 230 235 240 

Ala Ser He Leu Ala Lys Gly Glu Glu He Leu Ala Asn Gly Gly Ser 

245 250 255 

Cys He Ala Gly Pro Asp Gly Lys Trp Val Val Gly Pro Leu Val Gly 

260 265 270 

Glu Glu Lys Leu lie Val Ala Thr He Asp His Cys Arg Val Arg Glu 

275 280 285 

Glu Arg Gin Asn Phe Asp Pro Ser Gly His Tyr Ser Arg Pro Asp Val 

290 295 300 

Leu Gin Leu Lys He Asn Arg Glu Arg Gin Ser Thr He Ser Phe Ser 
305 310 315 320 

Glu 



60 



<210> 13 
<211> 1014 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 13 

atgggcattc aacatccgaa atataaggtt gcggtggtgc aggcggcgcc ggcctggctc 

gatctcgatg cgtcgatcgc caaatcgatc gcgttgatcg aggaggcggc tgccaatggc 120 

gccaagctga tcgccttccc ggaggcgttc atccctggct atccctggta tatctggctg 180 

gactcgccgg cctgggcgat cggccgcggt tttgtgcagc gctatttcga caactcgctg 240 

gcctatgaca gcccgcaggc cgagaagctg cggctggcgg tgaagaaggc cggcctcacc 300 

gccgtgatcg gcctctccga - gcgcgagggc ggcagccttt atctcgcgca atggctgatc 360 

gggcccgatg gcgagaccat cgcaaagcgc cgcaagctgc ggccgaccca tgccgagcgc 420 

accgtctatg gcgaaggcga tggcagcgat ctcgcggtgc atgaccgccc cggcatcggc 480 
cggctcggcg cgctgtgctg ctgggagcat ctgcagccgc tgtcgaaata cgcgatgtat * 540 

gcccagaacg agcaggttca tgtcgcggcc tggccgagct tctcgctcta cgacccgttc 600 

gcgccggcgc tcggctggga ggtcaacaat gcggcctcac gcgtctatgc ggtggaaggc 660 

tcgtgcttcg tgctggcgcc ctgcgcgacg gtgtcgaagg cgatgatcga cgagctctgc 720 

gaccgcgacg acaagcacgg gctgctgcat gtcggcgggg gacacgccgc gatctatggg 7 80 

ccggacggct cttcgattgc ggagaaattg ccgccggagc aggagggcct gctctatgcc 840 

gacatcgatc tcggcgccat cgggattgcc aagaacgccg ccgatccggc cggacattac 900 

tcgcggcccg acgtgacgcg gctgttgctc aacaagaagc cgtcgaagcg tgtcgagcat 960 

ttttcgctgc cggtcgacaa tgtcgagccg gagatcgacg ccgccgccag ctga 1014 

<210> 14 
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<211> 337 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 14 

Met Gly He Gin His Pro Lys Tyr Lys Val Ala Val Val Gin Ala Ala 

15 10 15 

Pro Ala Trp Leu Asp Leu Asp Ala Ser He Ala Lys Ser He Ala Leu 

20 25 30 

He Glu Glu Ala Ala Ala Asn Gly Ala Lys Leu He Ala Phe Pro Glu 

35 40 45 

Ala Phe He Pro Gly Tyr Pro Trp Tyr He Trp Leu Asp Ser Pro Ala 

50 55 60 

Trp Ala He Gly Arg Gly Phe Val Gin Arg Tyr Phe Asp Asn Ser Leu 
65 70 75 80 

Ala Tyr Asp Ser Pro Gin Ala Glu Lys Leu Arg Leu Ala Val Lys Lys 

85 90 95 

Ala Gly Leu Thr Ala Val He Gly Leu Ser Glu Arg Glu Gly Gly Ser 

100 105 110 

Leu Tyr Leu Ala Gin Trp Leu He Gly Pro Asp Gly Glu Thr He Ala 

115 120 125 

Lys Arg Arg Lys Leu Arg Pro Thr His Ala Glu Arg Thr Val Tyr Gly 

130 135 140 

Glu Gly Asp Gly Ser Asp Leu Ala Val His Asp Arg Pro Gly He Gly 
145 150 . 155 160 

Arg Leu Gly Ala Leu Cys Cys Trp Glu His Leu Gin Pro Leu Ser Lys 

165 170 175 

Tyr Ala Met Tyr Ala Gin Asn Glu Gin Val His Val Ala Ala Trp Pro 

180 185 190 

Ser Phe Ser Leu Tyr Asp Pro Phe Ala Pro Ala Leu Gly Trp Glu Val 

195 200 205 

Asn Asn Ala Ala Ser Arg Val Tyr Ala Val Glu Gly Ser Cys Phe Val 

210 215 220. 

Leu Ala Pro Cys Ala Thr Val Ser Lys Ala Met He Asp Glu Leu Cys 
225 230 235 240 

Asp Arg Asp Asp Lys His Gly Leu Leu His Val Gly Gly Gly His Ala 

245 250 255 

Ala He Tyr Gly Pro Asp Gly Ser Ser He Ala Glu Lys Leu Pro Pro 

260 265 270 

Glu Gin Glu Gly Leu Leu Tyr Ala Asp He Asp Leu Gly Ala He Gly 

275 280 285 

He Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp 

290 295 300 

Val Thr Arg Leu Leu Leu Asn Lys Lys Pro Ser Lys Arg Val Glu His 
305 310 315 320 

Phe Ser Leu Pro Val Asp Asn Val Glu Pro Glu He Asp Ala Ala Ala 
325 330 335 

Ser 



<210> 15 
<211> 1047 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
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<400> 15 

atgccaacat caaaacaatt tagagtcgct gcagttcaag ccgccccggt atttcttgac 60 

ctggagggcg caataagcaa aggcatctcc ctcattgagg aggccgcttc caatggagcc 120 

aagctcattg ccttcccgga aacgtggatt cccggctacc cctggtggat ctggctggac 180 

tcacccgctt ggggcatgcg ctttgtccag cgctattttg acaactcgct catgctgggt 240 

agtgagcaag ccaagcgcat gaaccaggct gccgccaata acaagattta cgtggtgatg 300 

ggttatagcg aacgcagtgg cggcagcctc tacatgggcc aatccattat caacgacaag 360 

ggtgaaacga tttttacccg ccgcaaactc aagccaactc atgtcgagcg taccgtgttt 420 

ggggagggag acggcagcca tctttgcgta atggataccg agattggccg cgtcggcgcg 480 

atgtgctgtt gggaacattt gcagccgctc agcaaatatg caatgtattc tcaggatgaa 540 

caaattcaca ttgcctcctg gccgagcttt tcgttatatc ggggggcagc ctatgcactc 600 

ggccccgaac tgaacaacgc cgccagccaa atgtatgcag ccgaaggcca gtgctttgtc 660 

cttgcccctt gcgccaccgt ctcaaaggag atgatcgaaa tgctgataga tgatcccagg 720 

aaagagccgc ttctgctgga aggtggcggg ttcaccatga tttacggccc cgatgggcga 780 

cctctggcta aaccgttgcc tgaaaacgag gaagggctgc tatatgccga tattgacctg 840 

ggcatgattt caatggccaa ggctgccgcc gacccggcag gtcactacgc acgcccggat 900 

gtcactcgcc tactattcaa ttccgcgccc gccaatcgcg tcgagtatat caacccagcg 960 

tcaggcccaa ccgaatcctt aaaagatatg ggaaagatgc aaatggaggc cgaacagcaa 1020 
aaggcggccc tgcgagagat gatctaa 

<210> 16 
<211> 348 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 16 

Met Pro Thr Ser Lys Gin Phe Arg Val Ala Ala Val Gin Ala Ala Pro 

1 5 10 15 

Val Phe Leu Asp Leu Glu Gly Ala He Ser Lys Gly He Ser Leu lie 

20 25 30 

Glu Glu Ala Ala Ser Asn Gly Ala Lys Leu He Ala Phe Pro Glu Thr 

35 40 45 

Trp He Pro Gly Tyr Pro Trp Trp He Trp Leu Asp Ser Pro Ala Trp 

50 55 60 

Glv Met Arg Phe Val Gin Arg Tyr Phe Asp Asn Ser Leu Met Leu Gly 
'65 70 75 80 

Ser Glu Gin Ala Lys Arg Met Asn Gin Ala Ala Ala Asn Asn Lys He 

85 90 95 

Tvr Val Val Met Gly Tyr Ser Glu Arg Ser Gly Gly Ser Leu Tyr Met 

100 105 HO 

Glv Gin Ser lie He Asn Asp Lys Gly Glu Thr lie Phe Thr Arg Arg 

115 120 125 

Lys Leu Lys Pro Thr His Val Glu Arg Thr Val Phe Gly Glu Gly Asp 

130 135 140 

Gly Ser His Leu Cys Val Met Asp Thr Glu He Gly Arg Val Gly Ala 
145 150 155 160 

Met Cvs Cys Trp Glu His Leu Gin Pro Leu Ser Lys Tyr Ala Met Tyr 

165 170 175 

Ser Gin Asp Glu Gin He His lie Ala Ser Trp Pro Ser Phe Ser Leu 

180 185 190 

Tvr Arg Gly Ala Ala Tyr Ala Leu Gly Pro Glu Leu Asn Asn Ala Ala 

195 200 205 

Ser Gin Met Tyr Ala Ala Glu Gly Gin Cys Phe Val Leu Ala Pro Cys 

210 215 220 

Ala Thr Val Ser Lys Glu Met lie Glu Met Leu He Asp Asp Pro Arg 
225 230 235 240 

Lvs Glu Pro Leu Leu Leu Glu Gly Gly Gly Phe Thr Met lie Tyr Gly 
245 250 255 
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60 



Pro Asp Gly Arg Pro Leu Ala Lys Pro Leu Pro Glu Asn Glu Glu Gly 

260 265 270 

Leu Leu Tyr Ala Asp lie Asp Leu Gly Met lie Ser Met Ala Lys Ala 

275 280 285 

Ala Ala Asp Pro Ala Gly His Tyr Ala Arg Pro Asp Val Thr Arg Leu 

290 295 300 

Leu Phe Asn Ser Ala Pro Ala Asn Arg Val Glu Tyr lie Asn Pro Ala 
305 310 315 320 

Ser Gly Pro Thr Glu Ser Leu Lys Asp Met Gly Lys Met Gin Met Glu 

325 330 335 

Ala Glu Gin Gin Lys Ala Ala Leu Arg Glu Met lie 
340 345 

<210> 17 
<211> 993 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 17 

atgagagttg ttaaagccgc agctgtccaa ctgagtcccg tcctctatag ccgcgaggga 

acggtcgaga aggtcgtgcg gaagatccat gaacttgccg aagagggagt cgagttcgtc 120 

acctttcctg agaccgtggt gccttattac ccgtactttt cgttcgttca gacgcccttg 180 

cagcaaatct tcggaacaga gtatctgagg ctgctcgacc aggcagtcac cgtgccatcc 240 

gccgccaccg acgcgatcgg cgaggctgcc aggttcgctg gagttgttgt ctcgatcggc 300 

gtcaacgagc gagacggggg aactctgtac aacactcagc ttctcttcga tgccgacgga 360 

agcttaattc agcggcgccg caagatcacg cccacccatt acgagcgcat gatctggggc 420 

cagggtgacg gctcaggtct gcgggccgtt gatagcaagg ccggccgcat tggtcagctg 480 

gcatgctggg agcacaacaa tccactggcg cgctacgcgc tgatagccga cggcgagcag 540 

atccattcgg ccatgtatcc gggctccatg ttcggcgact cgtttgccaa aaagaccgaa 600 

atcaatatcc ggcagcatgc gctggagtct gcgtgcttcg tcgtgaacgc aacggcctgg 660 

ctggacggcg atcaacaggc gcaaatcatg aaggacaccg gctgcagcat cggcccgatc 720 

tccggcggtt gcttcaccac tatcgtggcg ccggacggtt ccctgatcgg cgagcccctc 780 

cgctcgggtg agggcgtggt catcgccgac ctcgacttca cgttaatcga caggcgtaag 840 

caggtgatgg actcgcgagg ccactacagc cggccggagt tgctcagcct cttaatagac 900 
cgcaccccta ccgcgcactt tcacgaacgc gcttcgcacc ccacgacagg agctgagcaa 
ggctccgagg atgtgttcga ggctaacatt taa 

<210> 18 
<211> 330 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 18 

Met Arg Val Val Lys Ala Ala Ala Val Gin Leu Ser Pro Val Leu Tyr 

15 10 15 

Ser Arg Glu Gly Thr Val Glu Lys Val Val Arg Lys He His Glu Leu 

20 25 30 

Ala Glu Glu Gly Val Glu Phe Val Thr Phe Pro Glu Thr Val Val Pro 

35 40 45 

Tyr Tyr Pro Tyr Phe Ser Phe Val Gin Thr Pro Leu Gin Gin He Phe 

50 55 60 

Gly Thr Glu Tyr Leu Arg Leu Leu Asp Gin Ala Val Thr Val Pro Ser 
65 70 75 80 

Ala Ala Thr Asp Ala He Gly Glu Ala Ala Arg Phe Ala Gly Val Val 
85 90 95 



900 
960 
993 
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Val Ser He Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn Thr 

100 105 110 

Gin Leu Leu Phe Asp Ala Asp Gly Ser Leu He Gin Arg Arg Arg Lys 

115 120 125 

He Thr Pro Thr His Tyr Glu Arg Met He Trp Gly Gin Gly Asp Gly 

130 135 140 

Ser Gly Leu Arg Ala Val Asp Ser Lys Ala Gly Arg He Gly Gin Leu 
145 150 155 160 

Ala Cys Trp Glu His Asn Asn Pro Leu Ala Arg Tyr Ala Leu He Ala 

165 170 175 

Asp Gly Glu Gin He His Ser Ala Met Tyr Pro Gly Ser Met Phe Gly 

180 185 190 

Asp Ser Phe Ala Lys Lys Thr Glu He Asn He Arg Gin His Ala Leu 

195 200 205 

Glu Ser Ala Cys Phe Val Val Asn Ala Thr Ala Trp Leu Asp Gly Asp 

210 215 220 

Gin Gin Ala Gin He Met Lys Asp Thr Gly Cys Ser He Gly Pro He 
225 230 235 240 

Ser Gly Gly Cys Phe Thr Thr He Val Ala Pro Asp Gly Ser Leu He 

245 250 255 

Gly Glu Pro Leu Arg Ser Gly Glu Gly Val Val He Ala Asp Leu Asp 

260 265 270 

Phe Thr Leu He Asp Arg Arg Lys Gin Val Met Asp Ser Arg Gly His 

275 280 285 

Tyr Ser Arg Pro Glu Leu Leu Ser Leu Leu He Asp Arg Thr Pro Thr 

290 295 300 

Ala His Phe His Glu Arg Ala Ser His Pro Thr Thr Gly Ala Glu Gin 
305 310 315 320 

Gly Ser Glu Asp Val Phe Glu Ala Asn He 
325 330 

<210> 19 
<211> 1050 
<212> DNA 
<213> Unknown 

<220> ' 

<223> Obtained from an environmental sample 
<400> 19 

atggaaagca acttccttgc cgcagcagtg caagcagaac cggtttactt caatgctttt 

cagacggccg aaaaggccgc gtcattgatt gacgatgccg gtcggcaggg ggctcgctta 120 

gtgacatttc ccgaaacgtg gctgcccggt tacccgtact ggatctggct tggtgccccc 180 

gcctggggaa tgcatcattt catcctaaag taccatcaaa actcgccggt tgcaggagga 240 

ccagaggaac agatcctttg tcaggcggcc cgccgcaacg ggatttttgt cgtcatggga 300 

ctcagcgaga aaatcggggc aagcctctac atggcgcagt ggttcatcag tccagacggc 360 

aaagtggtcg ctcgccgacg caaattgaag cctactcacg tcgaacgttc ggtcttcggg 420 

gaaggggatg gttccgacat tgtcgttctt gatacacccc ttggaaaggt cgggggcctt 480 

tgctgctggg agcacatgca gccactttcg aagtacgcca tgtactcgca aggcgagcag 540 

atccatgctg cttcttggcc gagtgttagc gtctatcgcg ataaaattta cgttctgggg 600 

ccggagctga acggtgccgc caatcagatg tatgcggcag aaggtcagtg tttcgtcctg 660 

gcatcctggg caacggtttc acaagcggct atcgatcttt tttgcgacac gcccgacaag 720 

gccgcgctca tgaaaattgg tggtggtttt tcccagatct atgggccaga cgggtgcccc 780 
ctggcgaagc cgttgccgga ggacgtcgaa ggattggtga ccgctgagat tgacttcaat 
gccatcacgc gcgtgaaagc agcggcggac cccgtagggc actatagccg gcccgatgta 
ttccgcctgt tgttcaatcg tacgcgccaa gaacgcgtgg tttctgtcaa cacgtttgtg 

ccaggtgtca cccagcgaac cgccaagaat gggtcggcgg acgaattggt cggtcacccg 1020 

gagaacgctg tcgcccgggc tgcagagtaa 1050 

<210> 20 
<211> 349 



60 



840 
900 
960 
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<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 20 

Met Glu Ser Asn Phe Leu Ala Ala Ala Val Gin Ala Glu Pro Val Tyr 

1 5 io is 

Phe Asn Ala Phe Gin Thr Ala Glu Lys Ala Ala Ser Leu lie Asp Asp 

20 25 30 

Ala Gly Arg Gin Gly Ala Arg Leu Val Thr Phe Pro Glu Thr Trp Leu 

35 40 45 

Pro Gly Tyr Pro Tyr Trp He Trp Leu Gly Ala Pro Ala Trp Gly Met 

50 55 60 

His His Phe He Leu Lys Tyr His Gin Asn Ser Pro Val Ala Gly Glv 
65 70 75 80 

Pro Glu Glu Gin He Leu Cys Gin Ala Ala Arg Arg Asn Gly He Phe 

85 90 95 

Val Val Met Gly Leu Ser Glu Lys He Gly Ala Ser Leu Tyr. Met Ala 

100 105 no 

Gin Trp Phe He Ser Pro Asp Gly Lys Val Val Ala Arg Arg Arg Lys 

X15 120 125 

Leu Lys Pro Thr His Val Glu Arg Ser Val Phe Gly Glu Gly Asp Glv 

130 135 140 

Ser Asp He Val Val Leu Asp Thr Pro Leu Gly Lys Val Gly Gly Leu 
145 150 155 160 

Cys Cys Trp Glu His Met Gin Pro Leu Ser Lys Tyr Ala Met Tyr Ser 

165 170 175 

Gin Gly Glu Gin He His Ala Ala Ser Trp Pro Ser Val Ser Val Tyr 

180 185 190 

Arg Asp Lys He Tyr Val Leu Gly Pro Glu Leu Asn Gly Ala Ala Asn 

195 200 205 

Gin Met Tyr Ala Ala Glu Gly Gin Cys Phe Val Leu Ala Ser Trp Ala 

210 215 220 

Thr Val Ser Gin Ala Ala He Asp Leu Phe Cys Asp Thr Pro Asp Lys 
225 230 ■ 235 240 

Ala Ala Leu Met Lys lie Gly Gly Gly Phe Ser Gin He Tyr Gly Pro 

245 250 255 

Asp Gly Cys Pro Leu Ala Lys Pro Leu Pro Glu Asp Val Glu Gly Leu 

260 265 270 

Val Thr Ala Glu He Asp Phe Asn Ala He Thr Arg Val Lys Ala Ala 

275 280 285 

Ala Asp Pro Val Gly His Tyr Ser Arg Pro Asp Val Phe Arg Leu Leu 

290 295 300 

Phe Asn Arg Thr Arg Gin Glu Arg Val Val Ser Val Asn Thr Phe Val 
305 310 315 320 

Pro Gly Val Thr Gin Arg Thr Ala Lys Asn Gly Ser Ala Asp Glu Leu 

325 330 335 

Val Gly His Pro Glu Asn Ala Val Ala Arg Ala Ala Glu 
340 345 

<210> 21 
<211> 1065 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 21 
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420 
480 
540 
600 
660 



atggcactag aacatccgaa gtacgtggcg gccgtggttc aggccgcgcc cgaattcctg 60 

aatctagaca gagggatcga aaagacgatc gcattgatcg acgaagcggg acagaaaggg 120 

gcggccctga ttgcatttcc ggaaacctgg ctgccgggct atccgtttca tgtctggctc 180 

ggtcctcccg catgggcgct tggctcagga ttcgtccagc gctatttcga caactcgatg 240 

acgtacgata gtcctcaggc cgctgcactg agggacgctg ccgcgcgcaa cgggatcacg 300 

gtggtattgg gcttgtcgga gcgatgcggc ggcagcctct atatcgcgca atggatcatc 360 

ggcccggatg gcgcgacggt cgccacgcgc cgcaaattgc ggccgactca tatcgagcgc 420 
accgttttcg gcgatggcga cggcagcgat ctggcagtac acgatctcaa catcggccgc 
cttggcgcac tgtgctgctg ggagcacatt cagccgctga ccaagtacgc gatgtatgcg 
cagcacgaac aggtgcacgt cgcggcctgg ccgagcttct ccatgtatga attcgcgccc 
gcgctcggtc acgaggtgaa caacgcagtc agccgcgtct atgccgttga gggatcgtgc 

ttcgtgctcg cgccgtgcgc ggtcatcagc gagcaaatgg tcgacatgtt gtgcgacacg 720 

gcagacaagc gcgcgatgat acgtgccggc ggcgggcacg cagtggcgtt cgggccggac 780 

ggcgaagctc tggtcgagaa actgccggaa aatgaggaag gcttgctgct ggtcgatatc 840 

gatctcggtc gcatctcgct tgcgaaggct gcggccgacc ccgtcggtca ctacgcgcgc 900 

cccgatgtct tgcggctctg gttcgacaag caaccgcggc ggtgcgtcga acatgccggc 960 

gagaacgacg cgtcgcgcag gtcgcacggg tcgtccgggt cacaatcgcc ggcgcaggat 1020 

gggccggcga acgacatggt agaccgtcag gaaaacgtcg attga 1065 

<210> 22 
<211> 354 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 22 

Met Ala Leu Glu His Pro Lys Tyr Val Ala Ala Val Val Gin Ala Ala 

15 10 I 5 

Pro Glu Phe Leu Asn Leu Asp Arg Gly He Glu Lys Thr He Ala Leu 

20 25 30 

He Asp Glu Ala Gly Gin Lys Gly Ala Ala Leu He Ala Phe Pro Glu 

35 40 45 

Thr Trp Leu Pro Gly Tyr Pro Phe His Val Trp Leu Gly Pro Pro Ala 

50 55 60 

Trp Ala Leu Gly Ser Gly Phe Val Gin Arg Tyr Phe Asp Asn Ser Met 
65 70 75 80 

Thr Tyr Asp Ser Pro Gin Ala Ala Ala Leu Arg Asp Ala Ala Ala Arg 

85 90 95 

Asn Gly He Thr Val Val Leu Gly Leu Ser Glu Arg Cys Gly Gly Ser 

100 105 HO 

Leu Tyr He Ala Gin Trp He He Gly Pro Asp Gly Ala Thr Val Ala 

115 120 125 

Thr Arg Arg Lys Leu Arg Pro Thr His He Glu Arg Thr Val Phe Gly 

130 135 140 

Asp Gly Asp Gly Ser Asp Leu Ala Val His Asp Leu Asn lie Gly Arg 
145 150 155 160 

Leu Gly Ala Leu Cys Cys Trp Glu His He Gin Pro Leu Thr Lys Tyr 

165 170 175 

Ala Met Tyr Ala Gin His Glu Gin Val His Val Ala Ala Trp Pro Ser 

180 185 190 

Phe Ser Met Tyr Glu Phe Ala Pro Ala Leu Gly His Glu Val Asn Asn 

195 200 205 

Ala Val Ser Arg Val Tyr Ala Val Glu Gly Ser Cys Phe Val Leu Ala 

210 215 220 

Pro Cys Ala Val He Ser Glu Gin Met Val Asp Met Leu Cys Asp Thr 
225 230 235 240 

Ala Asp Lys Arg Ala Met He Arg Ala Gly Gly Gly His Ala Val Ala 

245 250 255 

Phe Gly Pro Asp Gly Glu Ala Leu Val Glu Lys Leu Pro Glu Asn Glu 
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260 

Glu Gly Leu Leu Leu Val Asp He 
275 280 
Lys Ala Ala Ala Asp Pro Val Gly 

290 295 
Arg Leu Trp Phe Asp Lys Gin Pro 
305 310 
Glu Asn Asp Ala Ser Arg Arg Ser 
325 

Pro Ala Gin Asp Gly Pro Ala Asn 
340 

Val Asp 



265 270 
Asp Leu Gly Arg He Ser Leu Ala 
285 

His Tyr Ala Arg Pro Asp Val Leu 
300 

Arg Arg Cys Val Glu His Ala Gly 
315 320 
His Gly Ser Ser Gly Ser Gin Ser 

330 335 
Asp Met Val Asp Arg Gin Glu Asn 
345 350 



420 
480 
540 
600 
660 
720 
780 
840 
900 



<210> 23 
<211> 1005 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<400> 23 cn 

atgaccgtta tcaaagcagc cgccattcaa atcagccccg tgctctacag ccgggcgggg 6Q 

acagtcgaga aagttgttag gaaggttaga gagctcgggg ccaaaggtgt ccgattcgct 120 

acctttcccg aaaccatcat accgtactac ccgtacttct cgttcgttca gtcggcgttc 180 

gacatgaagc ttgggagtga acatcagcgg ctgctcgacg aatcagtcac aattccttcg 240 

tccgagacgg acgcgatcgc ccaggccgcc aaggaagcgg gcatggtggt gtccgtcggg 300 

gtcaatgagc gcgatgggcg atccatctac aacactcaac ttctgttcga cgctgatggc 360 
acgctcattc agcgtaggcg aaagatcacc ccgacctatc acgagcgcat gatttggggt 
caaggcgatg gatccggcct acgcgcggtc gatagcgccg tgggccggat cggccagctt 

gcctgctggg agcactacct tcccctggcg cggtacgccc tcatcgcgga cggagagcaa 540 

atccactcgg caatgtatcc aggctcgttc gctggtccgc tatttgccga gcagatagag 600 
gttagtatcc gccagcacgc gcttgagtca gcctgcttcg tcgtcaacgc gaccggatgg 

cttagcgccg agcagcaagc tcaaatagtg aaggataccg gatgcgtcgt tggaccaatc 720 

tccggtggct gctttacggc gattgttgat ccggagggtc ggatcatggg ggcgccactc 780 

aaggcaggtg agggggaggt catcgcagat ctcgattttg cgcagattga tttccgcaag 840 

cgtgtgatgg atacgcgagg gcactacagc cgccccgaac ttctaagcct cacgatcgac 900 

cgcagtcagc accatcacat gactgagcga ggcgccgatc accgtgtaga ccacgcaaag 960 

ccaacggtca ccgcagagca gtcggccgtc gagccggcgg aatga lous 

<210> 24 
<211> 334 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

Met°Thr 4 Val lie Lys Ala Ala Ala He Gin He Ser Pro Val Leu Tyr 

1 5 10 15 

Ser Arg Ala Gly Thr Val Glu Lys Val Val Arg Lys Val Arg Glu Leu 

20 25 30 

Gly Ala Lys Gly Val Arg Phe Ala Thr Phe Pro Glu Thr He He Pro 

35 40 45 

Tyr Tyr Pro Tyr Phe Ser Phe Val Gin Ser Ala Phe Asp Met Lys Leu 

50 55 60 

Gly Ser Glu His Gin Arg Leu Leu Asp Glu Ser Val Thr He Pro Ser 

65 ' 70 75 80 

Ser Glu Thr Asp Ala He Ala Gin Ala Ala Lys Glu Ala Gly Met Val 
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85 










90 


95 




Val 


Ser 


Val 


Gly Val Asn 


Glu Arg Asp 


Gly 


Arg Ser He Tyr Asn 


Thr 








100 










105 




110 




Gin 


Leu 


Leu 


Phe Asp Ala Asp Gly Thr 


Leu 


lie Gin Arg Arg Arg 


Lys 






115 










120 






125 




lie 


Thr 


Pro 


Thr 


Tyr 


His 


Glu Arg Met 


He 


Trp Gly Gin Gly Asp 


Gly 




130 










135 








140 




Ser 


Gly 


Leu 


Arg 


Ala 


Val 


Asp 


Ser 


Ala 


Val 


Gly Arg He Gly Gin 


Leu 


145 










150 










155 


160 


Ala 


Cys 


Trp 


Glu 


His 


Tyr 


Leu 


Pro 


Leu 


Ala 


Arg Tyr Ala Leu He 


Ala 










165 










170 


175 




Asp Gly 


Glu 


Gin 


He 


His 


Ser 


Ala 


Met 


Tyr 


Pro Gly Ser Phe Ala 


Gly 








180 










185 




190 




Pro 


Leu 


Phe 


Ala 


Glu 


Gin 


He 


Glu 


Val 


Ser 


lie Arg Gin His Ala 


Leu 






195 










200 






205 




Glu 


Ser 


Ala 


Cys 


Phe 


Val 


Val 


Asn 


Ala 


Thr 


Gly Trp Leu Ser Ala 


Glu 




210 










215 








220 




Gin 


Gin 


Ala 


Gin 


He 


Val 


Lys 


Asp 


Thr 


Gly 


Cys Val Val Gly Pro 


lie 


225 










230 










235 


240 


Ser 


Gly 


Gly 


Cys 


Phe 


Thr 


Ala 


He 


Val 


Asp 


Pro Glu Gly Arg He 


Met 










245 










250 


255 




Gly Ala 


Pro 


Leu Lys Ala Gly Glu Gly 


Glu 


Val He Ala Asp Leu 


Asp 








260 










265 




270 




Phe 


Ala 


Gin 


He Asp Phe Arg Lys Arg 


Val 


Met Asp Thr Arg Gly 


His 






275 










280 






285 




Tyr 


Ser 


Arg 


Pro 


Glu 


Leu 


Leu 


Ser 


Leu 


Thr 


He Asp Arg Ser Gin 


His 


290 










295 








300 




His 


His 


Met 


Thr Glu Arg Gly Ala Asp 


His 


Arg Val Asp His Ala 


Lys 


305 










310 










315 


320 


Pro 


Thr 


Val 


Thr 


Ala 


Glu 


Gin 


Ser 


Ala 


Val 


Glu Pro Ala Glu 





325 330 

<210> 25 
<211> 939 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 25 

gtgtcatcaa ctatcaaagt cgccattatt caggccgctc ctgcttacta cgacctgcag 60 

gcgtcgctgg caaaggccgc cagtctgatc cgcgaggcgg cacgcggcgg cgcgcaattc 120 

gtcgcgttcg gggagacatg gctgccgggc tatccgatgt ggctggattg gtgtcctggc 180 

gcgatcatct gggataaccc cgccaccaaa accgtcttcg cgcgcctcca tgaaaacagc 240 

gtcgccgttc ccggcaggga aacggcattt ctcgccgacc ttgcgatgtc gttaagcatc 300 

gtattatgca tcggcgtcaa tgagaaggtc atgaatgggc cgggacacgg cacgctctac 360 

aacacgctcc tgacgtttga tgcaacgggt gaaatcatca atcatcatcg caagttgatg 420 

ccaacctatg gcgagagatt ggtatggggg ccgggcgacg cagttggcgt gcaagcggtt 480 

gatagtacgg tcgggcgcat cggcgggctg atctgttggg agcactggat gccgctgcca 540 

cgccaactca tgcacaacag cggcgagcag attcacgtct gcgcatggcc gggcgtgcac 600 

gaaatgcacc agatcgcgag ccgtcattat gcattcgagg gccgctgctt tgtgctggcc 660 

gccggattga tcatgcccgc gttcgacctg cccagcgaac tcgaatttcc gcccgaactg 720 

gccgacaagc gcgactatct cctaatgaac ggcggcagcg ccatcatcaa gcccaatggc 780 

aaatatctcg ccgggccggt ttatgacgaa gagactattc tctgcgccga ccttgacctg 840 

actgagaaca tcaaggagca gatgacgctg gacgtgacag ggcattatgc gcgagcggaa 900 

ctgtttgact tgaatgtggt gcggcggcgg aatgcgtag 939 

<210> 26 
<211> 312 
<212> PRT 
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<213> Unknown 
<220> 

<223> Obtained from an environmental sample 
<400> 26 

Val Ser Ser Thr lie Lys Val Ala He He Gin Ala Ala Pro Ala Tyr 

15 10 " 

Tyr Asp Leu Gin Ala Ser Leu Ala Lys Ala Ala Ser Leu He Arg Glu 

20 25 30 

Ala Ala Arg Gly Gly Ala Gin Phe Val Ala Phe Gly Glu Thr Trp Leu 

35 40 45 

Pro Gly Tyr Pro Met Trp Leu Asp Trp Cys Pro Gly Ala He He Trp 

50 55 60 

Asp Asn Pro Ala Thr Lys Thr Val Phe Ala Arg Leu His Glu Asn Ser 
65 70 75 80 

Val Ala Val Pro Gly Arg Glu Thr Ala Phe Leu Ala Asp Leu Ala Met 

85 90 95 

Ser Leu Ser He Val Leu Cys He Gly Val Asn Glu Lys Val Met Asn 

100 105 HO 

Gly Pro Gly His Gly Thr Leu Tyr Asn Thr Leu Leu Thr Phe Asp Ala 

115 120 125 

Thr Gly Glu He .He Asn His His Arg Lys Leu Met Pro Thr Tyr Gly 

130 135 140 

Glu Arg Leu Val Trp Gly Pro Gly Asp Ala Val Gly Val Gin Ala Val 
145 150 155 160 

Asp Ser Thr Val Gly Arg He Gly Gly Leu He Cys Trp Glu His Trp 

165 170 175 

Met Pro Leu Pro Arg Gin Leu Met His Asn Ser Gly Glu Gin He His 

180 185 190 

Val Cys Ala Trp Pro Gly Val His Glu Met His Gin He Ala Ser Arg 

195 200 205 

His Tyr Ala Phe Glu Gly Arg Cys Phe Val Leu Ala Ala Gly Leu He 

210 215 220 

Met Pro Ala Phe Asp Leu Pro Ser Glu Leu Glu Phe Pro Pro Glu Leu 
225 230 235 240 

Ala Asp Lys Arg Asp Tyr Leu Leu Met Asn Gly Gly Ser Ala He He 

245 250 255 

Lys Pro Asn Gly Lys Tyr Leu Ala Gly Pro Val Tyr Asp Glu Glu Thr 

260 265 270 

He Leu Cys Ala Asp Leu Asp Leu Thr Glu Asn He Lys Glu Gin Met 

275 280 285 

Thr Leu Asp Val Thr Gly His Tyr Ala Arg Ala Glu Leu Phe Asp Leu 

290 295 300 

Asn Val Val Arg Arg Arg Asn Ala 
305 310 

<210> 27 
<211> 1056 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 27 

atgccaaccc ccagcgatca tttcaaaatc gccgctgttc aggcctcgcc cgtgtttctg 
gaccgggagg ccactgtgga aaaggcctgc cggttgatcg ccgaagccgc aaagcagggc 
gcccgcctca tcgtctttcc ggaatctttc atcccgacct acccggattg ggtgtgggcc 
gttcccccgg gaagggaaag aatcctgaac cagctgtatt ctgaattcct ggccaatgcc 
gtcgatgttc ccggcgcggc gaccgaacaa cttgcccagg ctgcacgaat ggccggcgcc 



60 
120 
180 
240 
300 
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tatgtgatta tgggcgtcac cgaaagagac acctcggcca gcggggccag cctctacaac 360 

accctgctct acttcagccc cgaaggcatc ctaatgggca aacaccggaa gctggttccc 420 

acggggggcg aacggctggt ctgggcctac ggagacggca gcacgctgga ggtctacgac 480 

actccgctgg gaaagatcgg cgggctgatc tgctgggaga actacatgcc cctggcccgg 540 

tacacgatgt acgcctgggg cacccagatt tacatcgccg ccacctggga ccgcggggaa 600 

ccgtggctct ccaccctgcg gcatatcgcc aaggaaggaa gggtctacgt catcgggtgc 660 

tgcatcgccc tgcgccaggg ggatatcccg gaccggttcg agtacaaggg aaaattttat 720 

tccgggtccc gggagtggat caatgagggc gacagcgcca tcgtgaaccc ggacggggaa 780 

ttcatcgccg ggccggtgcg gatgaaggag gagatcctgt atgccgagat agacccccgg 840 

cagatgcggg gccccaagtg gatgctcgat gtggccggtc attacgcccg gccggatatc 900 

ttcgagctca tcgtccaccg gaatccccac ccgatgatca aaatcgccga agacaggggc 960 

acggggatcg cctcaagttt gattcgcccc cgccctaacc ttcccccatc aagggggagg 1020 

aaatcggcaa gaagcaaacg caagcccaaa aaatga 1056 

<210> 28 

<211> 351 

<212> PRT 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 28 

Met Pro Thr Pro Ser Asp His Phe Lys lie Ala Ala Val Gin Ala Ser 

15 10 15 

Pro Val Phe Leu Asp Arg Glu Ala Thr Val Glu Lys Ala Cys Arg Leu 

20 25 30 

lie Ala Glu Ala Ala Lys Gin Gly Ala Arg Leu lie Val Phe Pro Glu 

35 40 45 

Ser Phe He Pro Thr Tyr Pro Asp Trp Val Trp Ala Val Pro Pro Gly 

50 55 60 

Arg Glu Arg He Leu Asn Gin Leu Tyr Ser Glu Phe Leu Ala Asn Ala 
65 70 75 80 

Val Asp Val Pro Gly Ala Ala Thr Glu Gin Leu Ala Gin Ala Ala Arg 

85 90 95 

Met Ala Gly Ala Tyr Val He Met Gly Val Thr Glu Arg Asp Thr Ser 

100 105 110 

Ala Ser Gly Ala Ser Leu Tyr Asn Thr Leu Leu Tyr Phe Ser Pro Glu 

115 120 125 

Gly He Leu Met Gly Lys His Arg Lys Leu Val Pro Thr Gly Gly Glu 

130 135 140 

Arg Leu Val Trp Ala Tyr Gly Asp Gly Ser Thr Leu Glu Val Tyr Asp 
145 150 155 160 

Thr Pro Leu Gly Lys lie Gly Gly Leu He Cys Trp Glu Asn Tyr Met ' 

165 170 175 

Pro Leu Ala Arg Tyr Thr Met Tyr Ala Trp Gly Thr Gin He Tyr He 

180 185 190 

Ala Ala Thr Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu Arg His 

195 200 205 

He Ala Lys Glu Gly Arg Val Tyr Val He Gly Cys Cys He Ala Leu 

210 215 220 

Arg Gin Gly Asp He Pro Asp Arg Phe Glu Tyr Lys Gly Lys Phe Tyr 
225 230 235 240 

Ser Gly Ser Arg Glu Trp He Asn Glu Gly Asp Ser Ala He Val Asn 

245 250 255 

Pro Asp Gly Glu Phe He Ala Gly Pro Val Arg Met Lys Glu Glu He 

260 265 270 

Leu Tyr Ala Glu He Asp Pro Arg Gin Met Arg Gly Pro Lys Trp Met 

275 280 285 

Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp He Phe Glu Leu He 
290 295 300 



Page 19 



WO 03/097810 



PCT/US03/15712 



Val His Arg Asn Pro His Pro Met lie Lys lie Ala Glu Asp Arg Gly 
305 310 315 320 

Thr Gly lie Ala Ser Ser Leu lie Arg Pro Arg Pro Asn Leu Pro Pro 

325 330 335 

Ser Arg Gly Arg Lys Ser Ala Arg Ser Lys Arg Lys Pro Lys Lys 
340 345 350 

<210> 29 
<211> 1017 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 29 

atgggcatcg aacatacgaa atacaaggtg gcggtggtgc aggcggcgcc ggcctggctc 60 

gacctcgagg cctcgatcgg caagtccatc ggcctaatca aggaggccgc ggacaagggc 120 

gccaagctga tcgcctttcc ggaggccttc atccccggtt acccctggta tatctggatg 180 

gactcgccgg cctgggcgat cggccgcggc ttcgtccagc gctatttcga caattcgctc 240 

tcctacgaca gtccccaggc cgagcggctg cgtgatgccg tgcgccaggc caagctcacc 300 

gccgtgatcg gcctgtccga acgcgacggc ggcagccttt acctggcgca atggttgatc 360 

gggcccgacg gcgaaaccat tgccaagcgc cgcaagctgc ggccgaccca tgccgagcgc 420 

accgtctatg gcgaaggcga cggcagcgat ctggccgtac atgcccggcc cgacatcggt 480 

cgcttgggcg cgctgtgctg ctgggagcat cttcagccgt tgtcgaagta cgcaatgtac 540 

gcccagaacg agcaggtcca cgtcgctgcc tggccgagct tctcgctcta cgatcccttc 600 

gccccggcgc tcggcgccga ggtcaacaac gctgcctcgc gcgtctatgc ggtggagggc 660 

tcctgcttcg tgctcgcgcc ttgcgcgacg gtgtcgcagg ccatgatcga cgaactctgc 720 

gatcggcccg ataagcatgc gctgctgcat gccggcggag gctttgccgc gatctacggc 780 

cccgacggca gccagatcgg cgagaagctg gcgccggatc aggagggtct gctgatcgcc 840 

gagattgatc tgggcgccat cggtgttgcc aagaacgcgg cagatcccgc cggtcattat 900 

tcacggccgg atgtgacgcg gttgctgctc aacaagaagc ggtaccagcg cgtcgagcaa 960 

tttgccttgc ccgccgacat ggtcgagccc gcggacatag gcgcggcggc gagctga 1017 

<210> 30 
<211> 338 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 30 



Met 


Gly 


lie 


Glu 


His 


Thr 


Lys 


Tyr 


Lys 


Val 


Ala Val 


Val 


Gin 


Ala 


Ala 


1 








5 










10 








15 




Pro 


Ala 


Trp 


Leu 
20 


Asp 


Leu 


Glu 


Ala 


Ser 
25 


He 


Gly Lys 


Ser 


He 
30 


Gly 


Leu 


lie 


Lys 


Glu 


Ala 


Ala 


Asp 


Lys 


Gly 


Ala 


Lys 


Leu He 


Ala 


Phe 


Pro 


Glu 




35 










40 








45 








Ala 


Phe 
50 


lie 


Pro 


Gly 


Tyr 


Pro 
55 


Trp 


Tyr 


He 


Trp Met 
60 


Asp 


Ser 


Pro 


Ala 


Trp 


Ala 


lie 


Gly 


Arg 


Gly 


Phe 


Val 


Gin 


Arg 


Tyr Phe 


Asp 


Asn 


Ser 


Leu 


65 










70 










75 








80 


Ser 


Tyr 


Asp 


Ser 


Pro 


Gin 


Ala 


Glu 


Arg 


Leu 


Arg Asp 


Ala 


Val 


Arg 


Gin 








85 










90 








95 




Ala 


Lys 


Leu 


Thr 


Ala 


Val 


He 


Gly 


Leu 


Ser 


Glu Arg 


Asp 


Gly 


Gly 


Ser 






100 










105 








110 






Leu 


Tyr 


Leu 


Ala 


Gin 


Trp 


Leu 


He 


Gly 


Pro 


Asp Gly 


Glu 


Thr 


He 


Ala 




115 










120 








125 








Lys 


Arg 


Arg 


Lys 


Leu 


Arg 


Pro 


Thr 


His 


Ala 


Glu Arg 


Thr 


Val 


Tyr 


Gly 



130 135 140 
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Glu Gly Asp Gly Ser Asp Leu Ala Val His Ala Arg Pro Asp He Gly 
145 150 155 160 

Arg Leu Gly Ala Leu Cys Cys Trp Glu His Leu Gin Pro Leu Ser Lys 

165 170 175 

Tyr Ala Met Tyr Ala Gin Asn Glu Gin Val His Val Ala Ala Trp Pro 

180 185 190 

Ser Phe Ser Leu Tyr Asp Pro Phe Ala Pro Ala Leu Gly Ala Glu Val 

195 200 205 

Asn Asn Ala Ala Ser Arg Val Tyr Ala Val Glu Gly Ser Cys Phe Val 

210 215 220 

Leu Ala Pro Cys Ala Thr Val Ser Gin Ala Met He Asp Glu Leu Cys 
225 230 235 240 

Asp Arg Pro Asp Lys His Ala Leu Leu His Ala Gly Gly Gly Phe Ala 

245 250 255 

Ala He Tyr Gly Pro Asp Gly Ser Gin He Gly Glu Lys Leu Ala Pro 

260 265 270 

Asp Gin Glu Gly Leu Leu He Ala Glu He Asp Leu Gly Ala He Gly 

275 280 285 

Val Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp 

290 295 300 

Val Thr Arg Leu Leu Leu Asn Lys Lys Arg Tyr Gin Arg Val Glu Gin 
305 310 315 320 

Phe Ala Leu Pro Ala Asp Met Val Glu Pro Ala Asp He Gly Ala Ala 
325 330 335 

Ala Ser 



<210> 31 
<211> 933 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 31 

atgaccagaa tagccattat tcagcgaccg cccgtgctgc tcgatcgaag cgccaccatt 60 

gcccgggccg tgcaatcggt cgccgaagcg gcagcgcaag gcgcgaccct gattgtcttg 120 

cccgaatcgt acatccctgg ctatccctca tggatctggc ggctcgcgcc tggcaaagac 180 

ggcgcgatcg tgggccagtt gcatgcgcgc ttgctggcca atgcggtcga cctgagcagc 240 

actgacctcg atgcgcttct tgaagcggcc cgtcagcacg gcgtgaccat tgtttgcggc 300 

atgaacgagt gcgaacggcg tcgcggcggc ggcaccttgt acaacacggt ggtcgtgatc 360 

ggaccggacg gcgtcatgct caaccggcat cgcaaattga tgccgaccaa tcccgagcgc 420 

atggtgcatg gctttggcga tgcatccgga ctgaaagcag ttgatacgcc tgccggccgg 4 80 

ctgggcacgc tgatctgctg ggagagctac atgccgctgg cacgctatgc cctgtacgag 540 

caaggcatcg agatctacat cgcaccaact tatgacagtg gtgacggctg gatcagcacc 600 

atgcgccaca ttgcactcga agggcgctgc tgggtgattg gcagcggcac ggtcctgaaa 660 

ggcagtgata ttccggacga tttcccggaa cgggcacgcc tgttccctga tccggatgag 720 

tggatcaacg atggtgattc ggtagttatc gatccgcagg gaaagatcgt tgccggtccg 780 

atgcgtaggg aagcaggcat tctatacgcc gatatcgacg tcgcgcgcgt agcaccatca 840 

cgccgcacgc tggatgtcgc ggggcattac gcgcgtccgg acgtcttcga gcttcgggta 900 

caccaggcac cgggggcacg agtaagtaat tga 933 

<210> 32 
<211> 310 
<212> PRT 
<213> Unknown 

<22Q> 

<223> Obtained from an environmental sample 
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<400> 32 

Met Thr Arg lie Ala He He Gin Arg Pro Pro Val Leu Leu Asp Arg 

1 5 10 15 

Ser Ala Thr He Ala Arg Ala Val Gin Ser Val Ala Glu Ala Ala Ala 

20 25 30 

Gin Gly Ala Thr Leu He Val Leu Pro Glu Ser Tyr He Pro Gly Tyr 

35 40 45 

Pro Ser Trp He Trp Arg Leu Ala Pro Gly Lys Asp Gly Ala He Val 

50 55 60 

Gly Gin Leu His Ala Arg Leu Leu Ala Asn Ala Val Asp Leu Ser Ser 
65 70 75 80 

Thr Asp Leu Asp Ala Leu Leu Glu Ala Ala Arg Gin His Gly Val Thr 

85 90 95 

He Val Cys Gly Met Asn Glu Cys Glu Arg Arg Arg Gly Gly Gly Thr 

100 105 HO 

Leu Tyr Asn Thr Val Val Val He Gly Pro Asp Gly Val Met Leu Asn 

115 120 125 

Arg His Arg Lys Leu Met Pro Thr Asn Pro Glu Arg Met Val His Gly 

130 135 140 

Phe Gly Asp Ala Ser Gly Leu Lys Ala Val Asp Thr Pro Ala Gly Arg 
145 150 155 160 

Leu Gly Thr Leu He Cys Trp Glu Ser Tyr Met Pro Leu Ala Arg Tyr 

165 170 175 

Ala Leu Tyr Glu Gin Gly He Glu lie Tyr He Ala Pro Thr Tyr Asp 

180 185 190 

Ser Gly Asp Gly Trp He Ser Thr Met Arg His He Ala Leu Glu Gly 

195 200 205 

Ara Cvs Trp Val He Gly Ser Gly Thr Val Leu Lys Gly Ser Asp He 

210 215 220 

Pro Asp Asp Phe Pro Glu Arg Ala Arg Leu Phe Pro Asp Pro Asp Glu 
225 230 235 240 

Trp He Asn Asp Gly Asp Ser Val Val He Asp Pro Gin Gly Lys He 

245 250 255 

Val Ala Gly Pro Met Arg Arg Glu Ala Gly He Leu Tyr Ala Asp He 

260 265 270 

Asp Val Ala Arg Val Ala Pro Ser Arg Arg Thr Leu Asp Val Ala Gly 

275 280 285 

His Tyr Ala Arg Pro Asp Val Phe Glu Leu Arg Val His Gin Ala Pro 

290 295 300 

Gly Ala Arg Val Ser Asn 
305 310 

<210> 33 

<211> 1026 

<212> DNA 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

atgttaagtc ccgtgacgca gtatcgcgcc gccgcggtgc aggcggcgcc atcttttctc 
gatctcgacc gcaccgtcga gaagacgatc gcgatcatcg agcaggcggc cgagcaggat 
gtgcgcctga tcgcgtttcc ggaaacctgg attcccggct atccgctctg gatctggctc 
ggctcgccgg cctggggcat gcgcttcgtg cagcgctatt tcgagaactc gctggtgcgc 
ggcagcaaac agtggaacgc gatcgcegat gcggcgcggc gccaccgcat S^gtcgtc 
gtcggcttca gcgagcgcgc gggaggcagc ctctacatgg gccaggcgat cttcggcccc 
gaaggcgagc tcatcgcggc gcgccggaag ctcaagccga cacacgccga gcgaacggtg 
?tcggcgagg gcgacggcag ccacttggcc gtttacgaga cgggcgttgg tcgcatcggc 
gccc?ctgct gctgggagca catccagccg ctctcgaaat acgcgatgta tgcggccaac 
gaacaggrgc atgtggcctc gtggccgtgc ttcagccttt atcgcggcat ggcctatgcg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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900 
960 



ctcgggccgg aggtgaacac cgccgcgagc caggtctacg cggtcgaggg cggctgctac 660 

gtgctggcct cctgtctcgt cgtgacaccc gagatcctga aggtgctgat cgacacgccc 720 

gacaaggagc cgttgctgct cgccggcggg gggttctcga tgatcttcgg ccccgacggc 780 

cgcgcgctcg cccagccgct gccggagacc gaagaggggc tcgtcacggc cgagatcgat 840 
ctcggcgcga tcgcgctcgc caaggccgcg gccgatcccg ccggccatta cgcgcggccc 
gacgtgacgc ggttgttgct gaacccgcgc cccgcggcgc gcgtcgaagc gctgggtccg 

cgcttcgagg tcgtgcagag cgagcaggcc gagccgccca cgcaaccggc cgaagcggcg 1020 

gattga 1026 

<210> 34 
<211> 341 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 34 

Met Leu Ser Pro Val Thr Gin Tyr Arg Ala Ala Ala Val Gin Ala Ala 

1 5 10 15 

Pro Ser Phe Leu Asp Leu Asp Arg Thr Val Glu Lys Thr He Ala He 

20 25 30 

He Glu Gin Ala Ala Glu Gin Asp Val Arg Leu He Ala Phe Pro Glu 

35 40 45 

Thr Trp lie Pro Gly Tyr Pro Leu Trp He Trp Leu Gly Ser Pro Ala 

50 55 60 

Trp Gly Met Arg Phe Val Gin Arg Tyr Phe Glu Asn Ser Leu Val Arg 
65 70 75 80 

Gly Ser Lys Gin Trp Asn Ala He Ala Asp Ala Ala Arg Arg His Arg 

85 90 95 

Met Thr Val Val Val Gly Phe Ser Glu Arg Ala Gly Gly Ser Leu Tyr 

100 105 110 

Met Gly Gin Ala He Phe Gly Pro Glu Gly Glu Leu He Ala Ala Arg 

115 120 125 

Arg Lys Leu Lys Pro Thr His Ala Glu Arg Thr Val Phe Gly Glu Gly 

130 135 140 

Asp Gly Ser His Leu Ala Val Tyr Glu Thr Gly Val Gly Arg He Gly 
145 150 155 160 

Ala Leu Cys Cys Trp Glu His He Gin Pro Leu Ser Lys Tyr Ala Met 

165 170 175 

Tyr Ala Ala Asn Glu Gin Val His Val Ala Ser Trp Pro Cys Phe Ser 

180 185 190 

Leu Tyr Arg Gly Met Ala Tyr Ala Leu Gly Pro Glu Val Asn Thr Ala 

195 200 205 

Ala Ser Gin Val Tyr Ala Val Glu Gly Gly Cys Tyr Val Leu Ala Ser 

210 215 220 

Cys Leu Val Val Thr Pro Glu He Leu Lys Val Leu lie Asp Thr Pro 
225 230 235 240 

Asp Lys Glu Pro Leu Leu Leu Ala Gly Gly Gly Phe Ser Met He Phe 

245 250 255 

Gly Pro Asp Gly Arg Ala Leu Ala Gin Pro Leu Pro Glu Thr Glu Glu 

260 265 270 

Gly Leu Val Thr Ala Glu He Asp Leu Gly Ala He Ala Leu Ala Lys 

275 280 285 

Ala Ala Ala Asp Pro Ala Gly His Tyr Ala Arg Pro Asp Val Thr Arg 

290 295 300 

Leu Leu Leu Asn Pro Arg Pro Ala Ala Arg Val Glu Ala Leu Gly Pro 
305 310 315 320 

Arg Phe Glu Val Val Gin Ser Glu Gin Ala Glu Pro Pro Thr Gin Pro 

325 330 335 

Ala Glu Ala Ala Asp 
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340 

<210> 35 
<211> 942 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 35 

atgacagctc taaaaattgc tgctgttcaa atgtgcgccg aattgggcgc tacagatcga 60 

aacctgagtg cagctggatc attcgtgcgc gacgcatttc gcgaaggtgc ccagtgggta 120 

atcctcccag agttttttac ctcggcaatg gcattcgcac cttcgatggc gcaagcttgg 180 

ttgccactgg aaggaaaggc gctagcgatg atgcgcagcc ttgcgcgtca attcgatggg 240 

gttgttggag gctcatatgt tgccagagag gggaacgact gcgtaaatgc ctttcttctc 300 

gtctttccgg atggaagcta ctaccggcat gacaaagata ttccaacaat gtgggagaac 360 

tgttactaca tcggcggcgt cgacgatggg gtgctggaaa caccaattgg tgcggtggga 420 

gttgcactgt gttgggagtt catccgaaca caaaccgccc gaagactgaa ggatcgcgtt 480 

caattagtgg ttggcggtac ttgctggtgg gattttccga tgcctgtacc tgaacgatat 540 

ctgaggctga ccaggcatat ctccaggaac tttgagcgcg atgctccggc gcggttggcc 600 

agtatgttgg gtgtgcctgt tgtacacgct tcccatgctg gggattttac tgctgtcacc 660 

ccaggcaatg aaacgaagaa ttaccgatcc aactatctgg gagagaccca gatcgtcgat 720 

gccaatggaa atgtgttgaa gcgaatgaca gtggctgatg gtgagggtta cgtcattgct 780 

gacgttcaat tgggggccat atcaaccggt cgaacttcga tccccgacac cttctggacc 840 

tgcaagctaa cgccaggggc acaacaggct tgggatgaac aaaatgcttt tgggtgtggc 900 

tactatgaga acgtcacacg caaacaccta atcggtcgat ga 942 

<210> 36 
<211> 313 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 36 



Met 


Thr 


Ala 


Leu 


Lys 


He 


Ala 


Ala 


Val 


Gin 


Met Cys Ala Glu Leu 


Gly 


1 








5 










10 


15 




Ala 


Thr 


Asp 


Arg 


Asn 


Leu 


Ser 


Ala 


Ala 


Gly 


Ser Phe Val Arg Asp 


Ala 








20 










25 




30 




Phe 


Arg 


Glu 


Gly 


Ala 


Gin 


Trp 


Val 


He 


Leu 


Pro Glu Phe Phe Thr 


Ser 






35 










40 






45 




Ala 


Met 


Ala 


Phe 


Ala 


Pro 


Ser 


Met 


Ala 


Gin 


Ala Trp Leu Pro Leu 


Glu 




50 










55 








60 




Gly 


Lys 


Ala 


Leu 


Ala 


Met 


Met 


Arg 


Ser 


Leu 


Ala Arg Gin Phe Asp 


Gly 


65 










70 










75 


80 


Val 


Val 


Gly 


Gly 


Ser 


Tyr 


Val 


Ala 


Arg 


Glu 


Gly Asn Asp Cys Val 


Asn 










85 










90 


95 




Ala 


Phe 


Leu 


Leu 


Val 


Phe 


Pro 


Asp 


Gly Ser 


Tyr Tyr Arg His Asp 


Lys 








100 










105 




110 




Asp 


lie 


Pro 


Thr 


Met 


Trp 


Glu 


Asn 


Cys 


Tyr 


Tyr He Gly Gly Val 


Asp 






115 










120 






125 




Asp 


Gly 


Val 


Leu 


Glu 


Thr 


Pro 


He 


Gly Ala 


Val Gly Val Ala Leu 


Cys 




130 










135 








140 




Trp 


Glu 


Phe 


He 


Arg 


Thr 


Gin 


Thr 


Ala 


Arg 


Arg Leu Lys Asp Arg 


Val 


145 










150 










155 


160 


Gin 


Leu 


Val 


Val 


Gly 


Gly 


Thr 


Cys 


Trp 


Trp 


Asp Phe Pro Met Pro 


Val 










165 










170 


175 




Pro 


Glu 


Arg 


Tyr 


Leu 


Arg 


Leu 


Thr 


Arg 


His 


He Ser Arg Asn Phe 


Glu 






180 










185 




190 
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Arg 


Asp 


Ala 


Pro Ala Arg 


Leu 


Ala 


Ser 


Met Leu Gly Val Pro Val 


Val 


195 






200 




205 




His 


Ala 


Ser 


His Ala Gly 


Asp 


Phe 


Thr 


Ala Val Thr Pro Gly Asn 


Glu 




210 






215 






220 




Thr 


Lys 


Asn 


Tyr Arg Ser 


Asn 


Tyr 


Leu 


Gly Glu Thr Gin He Val 


Asp 


225 




230 








235 


240 


Ala 


Asn 


Gly 


Asn Val Leu 


Lys 


Arg 


Met 


Thr Val Ala Asp Gly Glu 


Gly 






245 








250 255 




Tyr 


Val 


He 


Ala Asp Val 


Gin 


Leu 


Gly 


Ala He Ser Thr Gly Arg 


Thr 






260 






265 


270 




Ser 


He 


Pro 


Asp Thr Phe 


Trp 


Thr 


Cys 


Lys Leu Thr Pro Gly Ala 


Gin 






275 






280 




285 




Gin 


Ala 


Trp 


Asp Glu Gin 


Asn 


Ala 


Phe 


Gly Cys Gly Tyr Tyr Glu 


Asn 




290 




295 






300 




Val 


Thr 


Arg 


Lys His Leu 


He 


Gly 


Arg 







305 310 



<210> 37 
<211> 993 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 37 

atggattcag atatgacgga tacttttaag gcagccatta ttcagcacgc gcctgttttt 

ttaaatctcg aagagagtct ggacaaagcc ggcagcctta tagaaaaggc tgccgatcaa 120 

ggcgcgaaag tgatcgcctt tcctgaaaca tggctgcccg gttatcccgt atggctcgac 180 

tactctccaa aagcgggtct gtgggactat cagcctgcaa aatctctcta tcgtctgcta 240 

gtcgataatt cagtcacctt acccggcaaa cacctcgatc aactcctctc catagcgcaa 300 

aagaccggcg catatgttgt aatgggggca cacgaacgag tgggtggaac actctataac 360 

acgacgatct atgttgggat tgatgggaag gagtacaaac ttcatagaaa gctggtgccg 420 

acctataccg aaagattgat ctgggggcgg ggagacggca gcacattgag tgtgttgatg 480 

acggattatg gcgttcttgg aggattgatc tgctgggagc actggatgcc tctggcaaga 540 

gccgcaatgc atgccagata tgaaaccctt catgtggcgc aatggccggc tgtaaaagat 600 

atccatcaga tagcaagcag acattatgct tttgaaggcc ggtgtttcgt gctcgcggca 660 

ggctctgttc tgactcgaag agatataata gaaggattca actcactggc tcgcgccgat 720 

agtgatgcat tggaacttct gaaagctatt tcgggagaag atagtgatct tattttgaat 780 

gggggaagcg cgataattgc gccgaatgga gagtatcttg cgggcccggt ctttaatgaa 840 

ccctccatta tttatgctga aattgatcct gcactgataa gtgagggcca tcttacactg 900 

gatacaagcg gacactactc gcgccctgac atttttcgtc tggagataaa cgatcaaccf 960 

caacatgatg taactttcag atcggggcat tag 9 93 

<210> 38 
<211> 330 
<212> PRT 
<213> Unknown 



60 



<220> 

<223> Obtained from an environmental sample 
<400> 38 

Met Asp Ser Asp Met Thr Asp Thr Phe Lys Ala Ala lie He Gin His 

15 10 15 

Ala Pro Val Phe Leu Asn Leu Glu Glu Ser Leu Asp Lys Ala Gly Ser 

20 25 30 

Leu He Glu Lys Ala Ala Asp Gin Gly Ala Lys Val lie Ala Phe Pro 

35 40 45 

Glu Thr Trp Leu Pro Gly Tyr Pro Val Trp Leu Asp Tyr Ser Pro Lys 
50 55 60 
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Ala Gly 


Leu 


Trp Asp Tyr 


Gin Pro 


Ala 


Lys 


Ser Leu Tyr Arg Leu Leu 


65 




70 








75 80 


Val Asp 


Asn 


Ser Val Thr 


Leu Pro 


Gly Lys 


His Leu Asp Gin Leu Leu 






85 






90 


95 


Ser lie 


Ala 


Gin Lys Thr Gly Ala 


Tyr 


Val 


Val Met Gly Ala His Glu 






100 




105 




110 


Arg Val 


Gly 


Gly Thr Leu 


Tyr Asn 


Thr 


Thr 


He Tyr Val Gly He Asp 




115 




120 






125 


Gly Lys 


Glu 


Tyr Lys Leu 


His Arg 


Lys 


Leu 


Val Pro Thr Tyr Thr Glu 


130 






135 






140 


Arg Leu 


He 


Trp Gly Arg 


Gly Asp 


Gly 


Ser 


Thr Leu Ser Val Leu Met 


145 




150 








155 160 


Thr Asp 


Tyr 


Gly Val Leu Gly Gly 


Leu 


He 


Cys Trp Glu His Trp Met 






165 






170 


175 


Pro Leu 


Ala 


Arg Ala Ala 


Met His 


Ala 


Arg 


Tyr Glu Thr Leu His Val 






180 




185 




190 


Ala Gin 


Trp 


Pro Ala Val 


Lys Asp 


He 


His 


Gin He Ala Ser Arg His 




195 




200 






205 


Tyr Ala 


Phe 


Glu Gly Arg 


Cys Phe 


Val 


Leu 


Ala Ala Gly Ser Val Leu 


210 






215 






220 


Thr Arg 


Arg 


Asp He He 


Glu Gly 


Phe 


Asn 


Ser Leu Ala Arg Ala Asp 


225 




230 








235 240 


Ser Asp 


Ala 


Leu Glu Leu 


Leu Lys 


Ala 


He 


Ser Gly Glu Asp Ser Asp 






245 






250 


255 


Leu He 


Leu 


Asn Gly Gly 


Ser Ala 


He 


He 


Ala Pro Asn Gly Glu Tyr 






260 




265 




270 


Leu Ala 


Gly 


Pro Val Phe 


Asn Glu 


Pro 


Ser 


He He Tyr Ala Glu He 




275 




280 






285 


Asp Pro 


Ala 


Leu He Ser 


Glu Gly 


His 


Leu 


Thr Leu Asp Thr Ser Gly 


290 






295 






300 


His Tyr 


Ser 


Arg Pro Asp 


He Phe 


Arg 


Leu 


Glu He Asn Asp Gin Pro 


305 




310 








315 320 


Gin His 


Asp 


Val Thr Phe 


Arg Ser 


Gly His 








325 






330 





<210> 39 
<211> 1008 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 39 

atgaaaaata tcaaaaactc agaaaaaagc agcacagtaa gagtcgctgc ggtacaaatc 60 

agtccggtgt tgtacaaccg cgaagctacc gttcaaaaag tagtcaacaa aatccttgaa 120 

ctaggaaaac aaggggtaca attcgccact tttccggaaa cgatagtgcc ttattatcct 180 

tatttctctt ttattcaggc gccttatgcc atgggcaaag aacacctgcg cttgcttgaa 240 

caatcagtta ctgttccgtc agccgcgacc gatgccataa gtgaggcggc aaaggaagcc 300 

aatatggtag tgtctattgg tgtcaatgaa cgagacggtg gtaccattta caatacgcaa 360 

ctcctttttg atgctgacgg aacattaatt cagcgcagac gtaaacttac accaacgtat 420 

catgaaagaa tgatttgggg acaaggtgac gcttcaggtc ttcgtgccac agacagcgct 480 

gttgggcgta tcgggcagtt ggcttgttgg gaacattaca atccattgtt ccgttatgct 540 

ttgattgctg atggagaaca aatccattct gccatgtatc ccggatcatt tttaggtgcg 600 

ttgcacggtg aacaaaccga aatcaatgta cgccaacacg ctttagaatc ggccagcttc 660 

gtcgtagtgg ctaccggttg gttggatgcc gatcaacaag cacaaattgc gaaagacacc 720 

ggtggaccaa tcggaccaat ttcgggaggt tgttttacag ccgttatagg ccctgacgga 780 

caactaatcg gggaagccct tacatcaggt gaaggggaag tgattgccga tattgatttg 840 

gcacaaattg atgcccgcaa aagattaatg gatgccagtg gtcactacaa ccgtcctgaa 900 

ttgttgagct tgcatatcga tcacactccg actgctccta tgcatgaaag agtagtttac 960 

actgagccgg gattagcaaa aagacaaaat gaaaattcat caaattaa 1008 



Page 26 



WO 03/097810 PCT/US03/15712 



<210> 40 
<211> 335 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 40 

Met Lys Asn lie Lys Asn Ser Glu Lys Ser Ser Thr Val Arg Val Ala 

15 10 15 

Ala Val Gin lie Ser Pro Val Leu Tyr Asn Arg Glu Ala Thr Val Gin 

20 25 30 

Lys Val Val Asn Lys lie Leu Glu Leu Gly Lys Gin Gly Val Gin Phe 

35 40 45 

Ala Thr Phe Pro Glu Thr He Val Pro Tyr Tyr Pro Tyr Phe Ser Phe 

50 55 60 

He Gin Ala Pro Tyr Ala Met Gly Lys Glu His Leu Arg Leu Leu Glu 
65 70 75 B0 

Gin Ser Val Thr Val Pro Ser Ala Ala Thr Asp Ala He Ser Glu Ala 

85 90 95 

Ala Lys Glu Ala Asn Met Val Val Ser He Gly Val Asn Glu Arg Asp 

100 105 HO 

Gly Gly Thr He Tyr Asn Thr Gin Leu Leu Phe Asp Ala Asp Gly Thr 

115 120 125 

Leu He Gin Arg Arg Arg Lys Leu Thr Pro Thr Tyr His Glu Arg Met 

130 135 140 

He Trp Gly Gin Gly Asp Ala Ser Gly Leu Arg Ala Thr Asp Ser Ala 
145 150 155 160 

Val Gly Arg He Gly Gin Leu Ala Cys Trp Glu His Tyr Asn Pro Leu 

165 170 175 

Phe Arg Tyr Ala Leu He Ala Asp Gly Glu Gin He His Ser Ala Met 

180 185 190 

Tyr Pro Gly Ser Phe Leu Gly Ala Leu His Gly Glu Gin Thr Glu He 

195 200 205 

Asn Val Arg Gin His Ala Leu Glu Ser Ala Ser Phe Val Val Val Ala 

210 215 220 

Thr Gly Trp Leu Asp Ala Asp Gin Gin Ala Gin He Ala Lys Asp Thr 
225 230 235 240 

Gly Gly Pro He Gly Pro He Ser Gly Gly Cys Phe Thr Ala Val He 

245 250 255 

Gly Pro Asp Gly Gin Leu He Gly Glu Ala Leu Thr Ser Gly Glu Gly 

260 265 270 

Glu Val He Ala Asp He Asp Leu Ala Gin He Asp Ma Arg Lys Arg 

275 280 285 

Leu Met Asp Ala Ser Gly His Tyr Asn Arg Pro Glu Leu Leu Ser Leu 

290 295 300 

His He Asp His Thr Pro Thr Ala Pro Met His Glu Arg Val Val Tyr 
305 310 315 320 

Thr Glu Pro Gly Leu Ala Lys Arg Gin Asn Glu Asn Ser Ser Asn 
325 330 335 

<210> 41 
<211> 966 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
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<400> 41 

atgcccaatg 

tttcttgatc 

gaaggagcgc 

tggggtattc 

aatgcggtca 

aatgcctacg 

tataatacgc 

gtgccaacgg 

tacgatacgc 

gcacgctatg 

ggccaaccct 

ggttgctgta 

taccatgctg 

ggacacttta 

ccacgtatgg 

gatgtgttcc 

tcataa 



agaataacat 
gtgaagcaac 
gcctgattgt 
cttccggtga 
ctattcccag 
tagtgatggg 
tgttgtacat 
gcggcgaacg 
cattgggaaa 
ctatgtatgc 
ggctttctac 
tcgcgatgcg 
aaatggatga 
tcgcagggcc 
tgcaaggccc 
agttgacggt 



cgctacattc 
catcgacaaa 
gtttccagaa 
gcaaggttta 
tgacgcgact 
aatgagcgaa 
caatgcgcag 
cctggtatgg 
actcggtggt 
ctgggggaca 
attacggcat 
aaaagacgat 
atggattaat 
tgtgcgcaag 
gaagtggatg 
gcatacgaat 



aaagttgccg 
gcttgcgggt 
gcgttcatcc 
ctcaatgaac 
gacaggctgt 
cggaatgtcg 
ggggagattt 
gcgcagggtg 
ctcatttgct 
caaatctatg 
atcgccaaag 
attccggata 
gttggcgaca 
caagaagaaa 
ctcgatgtgg 
gtgagagaga 



cagtccaggc 
tgattgccac 
caacctatcc 
tctatgcaga 
gcgaggccgc 
aggcgagtgg 
tagggaaaca 
atggcagcac 
gggaaaatta 
tcgcggcaac 
aaggcagggt 
gttactccat 
gtgtgattgt 
ttctctacgc 
cggggcatta 
tgatgcgggt 



cacaccggtg 
tgccggcaat 
tgaatgggtt 
gctgctcacc 
gcagcttgcg 
cgcaagccta 
tcgaaagctg 
gctgcaggtc 
tatgccgctg 
gtgggatcga 
atacgtgatt 
gaagcagaaa 
caatcccgaa 
ggagatcgat 
tgcgagacca 
ggaagatgat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
966 



<210> 42 
<211> 321 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 42 

Met Pro Asn Glu Asn 

1 5 
Ala Thr Pro Val Phe 
20 

Gly Leu He Ala Thr 
35 

Pro Glu Ala Phe He 
50 

Ser Gly Glu Gin Gly 
65 

Asn Ala Val Thr He 
85 

Ala Gin Leu Ala Asn 
100 

Val Glu Ala Ser Gly 
115 

Ala Gin Gly Glu He 
130 

Gly Glu Arg Leu Val 
145 

Tyr Asp Thr Pro Leu 
165 

Tyr Met Pro Leu Ala 
180 

Tyr Val Ala Ala Thr 
195 

Arg His He Ala Lys 
210 

Ala Met Arg Lys Asp 
225 

Tyr His Ala Glu Met 
245 

Val Asn Pro Glu Gly 



Asn He Ala Thr 
Leu Asp 
Ala Gly 



Pro Thr 

55 
Leu Leu 
70 

Pro Ser 

Ala Tyr 

Ala Ser 

Leu Gly 
135 
Trp Ala 
150 

Gly Lys 

Arg Tyr 

Trp Asp 

Glu Gly 
215 
Asp He 
230 

Asp Glu 



Arg Glu 
25 

Asn Glu 
40 

Tyr Pro 



Asn Glu 

Asp Ala 

Val Val 
105 
Leu Tyr 
120 

Lys His 

Gin Gly 

Leu Gly 

Ala Met 
185 
Arg Gly 
200 

Arg Val 



Pro Asp 
Trp He 
His Phe He Ala 



Phe Lys 
10 

Ala Thr 

Gly Ala 

Glu Trp 

Leu Tyr 

75 
Thr Asp 
90 

Met Gly 

Asn Thr 

Arg Lys 

Asp Gly 
155 
Gly Leu 
170 

Tyr Ala 

Gin Pro 

Tyr Val 

Ser Tyr 
235 
Asn Val 
250 

Gly Pro 



Val Ala 

He Asp 

Arg Leu 

45 
Val Trp 
60 

Ala Glu 

Arg Leu 

Met Ser 

Leu Leu 
125 
Leu Val 
140 

Ser Thr 

He Cys 

Trp Gly 

Trp Leu 
205 
He Gly 
220 

Ser Met 
Gly Asp 
Val Arg 



Ala Val Gin 
15 

Lys Ala Cys 
30 

He Val Phe 

Gly He Pro 

Leu Leu Thr 
80 

Cys Glu Ala 
95 

Glu Arg Asn 
110 

Tyr He Asn 

Pro Thr Gly 

Leu Gin Val 
160 

Trp Glu Asn 

175 
Thr Gin He 
190 

Ser Thr Leu 

Cys Cys He 

Lys Gin Lys 
240 

Ser Val He 

255 
Lys Gin Glu 



l 
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260 265 270 

Glu He Leu Tyr Ala Glu He Asp Pro Arg Met Val Gin Gly Pro Lys 

275 280 285 

Trp Met Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Gin 

290 295 300 

Leu Thr Val His Thr Asn Val Arg Glu Met Met Arg Val Glu Asp Asp 
305 310 315 320 

Ser 



<210> 43 
<211> 993 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 43 

atgagagtcg ttaaagccgc ggcggtccaa ctgaaacctg tcctctatag ccgagaggga 60 

actgtcgaaa acgtcgtccg taaaatccac gagcttggac agcaaggggt acagttcgcg 120 

acgtttccag agactgtggt gccttactac ccgtactttt cgatcgtgca gtccggctat 180 

caaattcttg gcggtggtga gttcctgaag ctgcttgatc agtcagtaac cgtgccatct 240 

ctcgctacgg aagcgatcgg cgaggcttgc aggcaggcgg gcgtcgttgt ctccatcggc 300 

gtcaacgagc gtgatggagg aactctatac aacacgcaac ttctctttga tgccgacgga 360 

acattgattc aaagacgacg caagatcaca cccacccatt acgagcgcat ggtctggggc 420 

cagggcgatg gctcaggttt acgggccatt gacagcaagg tcgcgcgcat tggtcaactg 480 

gcgtgttttg agcactacaa ccctctcgca cgttacgcga tgatggccga tggcgagcag 540 

atccattctg cgatgttccc cggctccatg ttcggcgata atttttcaga gaaggtggaa 600 

atcaacataa ggcagcatgc aatggagtct gggtgctttg tcgtttgcgc tactgcctgg 660 

ttggatgctg accagcaggc tcaaatcatg aaagacacgg gatgtgagat cggaccgatc 720 

tcaggaggtt gcttcacagc gatcgcggca ccagatggaa gccttatagg tgaacccatc 780 

cgctcaggtg aaggcgtttg tattgccgac ctcgatttca aacttatcga caagcggaag 840 

cacgtagtag acacacgcgg ccattatagc cggccagaat tgctcagcct cctgattgat 900 

cggacgccga cggcccacat acacgaaagg accgagcaac cgagggcggc catcgagaaa 960 

gagtcgcagg atgttttcac cgctgttgct taa 993 

<210> 44 

<211> 330 

<212> PRT 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 44 



Met Arg 


Val 


Val 


Lys 


Ala 


Ala 


Ala 


Val 


Gin 


Leu Lys 


Pro Val Leu 


Tyr 


1 






5 










10 




15 




Ser Arg 


Glu 


Gly 


Thr 


Val 


Glu 


Asn 


Val 


Val 


Arg Lys 


He His Glu 


Leu 




20 










25 






30 




Gly Gin 


Gin 


Gly 


Val 


Gin 


Phe 


Ala 


Thr 


Phe 


Pro Glu 


Thr Val Val 


Pro 


35 








40 








45 




Tyr Tyr 


Pro 


Tyr 


Phe 


Ser 


He 


Val 


Gin 


Ser 


Gly Tyr 


Gin He Leu 


Gly 


50 










55 








60 






Gly Gly 


Glu 


Phe 


Leu 


Lys 


Leu 


Leu 


Asp 


Gin 


Ser Val 


Thr Val Pro 


Ser 


65 








70 










75 




80 


Leu Ala 


Thr 


Glu 


Ala 


He 


Gly 


Glu 


Ala 


Cys 


Arg Gin Ala Gly Val Val 








85 








90 




95 




Val Ser 


He 


Gly 


Val 


Asn 


Glu 


Arg 


Asp 


Gly 


Gly Thr 


Leu Tyr Asn 


Thr 






100 










105 






110 




Gin Leu 


Leu 


Phe 


Asp 


Ala 


Asp 


Gly 


Thr 


Leu 


He Gin 


Arg Arg Arg 


Lys 
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115 










120 


He 


Thr 


Pro 


Thr 


His 


Tyr Glu Arg 




130 










135 




Ser Gly 


Leu Arg Ala 


Tie* 


Asp 


Ser 


145 










150 






Ala 


Cys 


Phe 


Glu 


His 
165 


Tyr 


Asn 


Pro 


Asp 


Gly 


Glu 


Gin 
180 


He 


His 


Ser 


Ala 


Asp 


Asn 


Phe 
195 


Ser 


Glu 


Lys 


Val 


Glu 
200 


Glu 


Ser 


Gly Cys 


Phe 


Val 


Val 


Cys 




210 










215 




Gin 


Gin 


Ala 


Gin 


He 


Met 


Ly3 Asp 


225 










230 






Ser Gly 


Gly 


Cys 


Phe 


Thr 


Ala 


He 










245 








Gly Glu 


Pro 


He 


Arg 


Ser Gly Glu 








260 










Phe 


Lys 


Leu 


He Asp 


Lys 


Arg 


Lys 






275 










280 


Tyr 


Ser 
290 


Arg 


Pro 


Glu 


Leu 


Leu 
295 


Ser 


Ala 


His 


He 


His 


Glu 


Arg 


Thr 


Glu 


305 










310 






Glu 


Ser 


Gin Asp Val 


Phe 


Thr 


Ala 










325 









125 



Met 


Val Trp Gly Gin 


Gly Asp Gly 




140 








Lys 


Val Ala Arg He 


Gly Gin 


Leu 




155 






160 


Leu 


Ala Arg Tyr Ala 


Met 


Met 


Ala 




170 




175 




Met 


Phe Pro Gly Ser 


Met 


Phe 


Gly 


185 




190 






He 


Asn He Arg Gin 


His 


Ala 


Met 




205 








Ala 


Thr Ala Trp Leu 


Asp 


Ala 


Asp 




220 








Thr 


Gly Cys Glu He 


Gly 


Pro 


He 




235 






240 


Ala 


Ala Pro Asp Gly 


Ser 


Leu 


He 




250 




255 




Gly 


Val Cys He Ala 


Asp 


Leu 


Asp 


265 




270 






His 


Val Val Asp Thr 


Arg Gly His 




285 








Leu 


Leu He Asp Arg 


Thr 


Pro 


Thr 




300 








Gin 


Pro Arg Ala Ala 


He 


Glu 


Lys 




315 






320 


Val 


Ala 









330 



<210> 45 
<211> 996 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 45 

gtgaaaccgc cgacgtcatt ccgcgtggcc gccgttcagg cctgtcccgt ctacctcgat 60 

cgcgacctga cgatcggcaa ggcagaaggg ttgatcgccg aggcggctgg aaacggcgcg 120 

aacctcatcg tgttccccga agcgttcgtg cctggctatc cggtgtgggt gtggttcatc 180 

ccgcccggtc gcacggcgga tcttcgcgaa gcgtacagcg tcctccacgc caactcgatc 240 

gcggtcccca gccagtcgac cgagcgtctg tgcgcggcgg cgcggcgcgc tggcgtcgcc 300 

gtggcgattg gtgtcaacga aagaaacagc gaggccagcg gcggcagcct cttcaacacg 360 

ctgctgtaca tcggaccgga cggcacgctg ctcggtaaac accgaaagct ggtgccaaca 420 

ggcggagagc gtcttgtctg ggccagcggc gacggcagcg accttgccgt gttcacactg 480 

cctttcgcgc gagtcggcgg actgatctgc tgggagaact acatgccgct cgcccgctac 540 

gcgctggcgg cctggggtgc gcaaatccac gtggcgccga cctgggaccg cggcgagccg 600 

tggctctcaa cactgcgtca tgtcgcgaag gaaggtagag ccgtgacgat cggctgctgt 660 

caggccgtcc gcaaggaaga cattccggac gggctggcat tcaagtcccg atacctggcc 720 

gacgtgggcg cctgggtcaa cccaggcggg agcgtcatcg tcgatcccga cggaaaaatt 780 

cttgccggac ctgcgaacga aaccgaaggc atcttgtacg ctgacatcag ggccgatcag 840 

ctcgtcgggc cgagatggca actcgacatt gccggacact acgcgcggcc ggacgtcttc 900 

gagctgatcg tgcatcggcg ttcgacgccg atgattcgcg aggtctcggc gcctcgtcgt 960 

cgcgcaagaa cgggaaagcg accgcgacgc cgctga 996 

<210> 46 
<211> 331 
<212> PRT 
<213> Unknown 



<220> 



Page 30 



WO 03/097810 PCT/US03/15712 



<223> Obtained from an environmental sample 
<400> 46 

Val Lvs Pro Pro Thr Ser Phe Arg Val Ala Ala Val Gin Ala Cys Pro 

I 5 10 15 

Val Tyr Leu Asp Arg Asp Leu Thr lie Gly Lys Ala Glu Gly Leu lie 

20 25 30 

Ala Glu Ala Ala Gly Asn Gly Ala Asn Leu He Val Phe Pro Glu Ala 

35 40 45 

Phe Val Pro Gly Tyr Pro Val Trp Val Trp Phe He Pro Pro Gly Arg 

50 55 60 

Thr Ala Asp Leu Arg Glu Ala Tyr Ser Val Leu His Ala Asn Ser He 
65 70 *75 80 

Ala Val Pro Ser Gin Ser Thr Glu Arg Leu Cys Ala Ala Ala Arg Arg 

85 90 95 

Ala Gly Val Ala Val Ala He Gly Val Asn Glu Arg Asn Ser Glu Ala 

100 105 HO 

Ser Gly Gly Ser Leu Phe Asn Thr Leu Leu Tyr He Gly Pro Asp Gly 

115 120 125 

Thr Leu Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly Gly Glu Arg 

130 135 140 

Leu Val Trp Ala Ser Gly Asp Gly Ser Asp Leu Ala Val Phe Thr Leu 
145 150 155 160 

Pro Phe Ala Arg Val Gly Gly Leu He Cys Trp Glu Asn Tyr Met Pro 

165 170 175 

Leu Ala Arg Tyr Ala Leu Ala Ala Trp Gly Ala Gin He His Val Ala 

180 185 190 

Pro Thr Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu Arg His Val 

195 200 205 

Ala Lys Glu Gly Arg Ala Val Thr He Gly Cys Cys Gin Ala Val Arg 

210 215 220 

Lvs Glu Asp He Pro Asp Gly Leu Ala Phe Lys Ser Arg Tyr Leu Ala 
225 230 235 240 

Asp Val Gly Ala Trp Val Asn Pro Gly Gly Ser Val He Val Asp Pro 

245 250 255 

Asp Gly Lys He Leu Ala Gly Pro Ala Asn Glu Thr Glu Gly He Leu 

260 265 270 

Tyr Ala Asp He Arg Ala Asp Gin Leu Val Gly Pro Arg Trp Gin Leu 

275 280 285 

Asp He Ala Gly His Tyr Ala Arg Pro Asp Val Phe Glu Leu He Val 

290 295 300 

His Arg Arg Ser Thr Pro Met He Arg Glu Val Ser Ala Pro Arg Arg 
305 310 315 320 

Arg Ala Arg Thr Gly Lys Arg Pro Arg Arg Arg 
325 330 

<210> 47 
<211> 1014 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 47 

gtgaaagaag caatcaaagt agcctgtgtg caagcagctc cagtctttct cgacctggac 
gccacagtgg acaagaccgt cgccctgatt gaggaggcag cccgtaacgg cgcacgccta 
atcgcctttc cagagacctg gattccaggc tacccatggt tcctttggct ggactcacca 
gcctggggga tgcaattcgt gcgccgatac cacgagaact cactggtcct cgacagccct 
caggccaagc gcatcagtga ggccgcccag cgcgccggta tatacgtcgc gctagggtac 
agcgaacgcg tgagcggaac cctctacatg gggcagtggc tcattgacga taagggcgaa 



60 
120 
180 
240 
300 
360 



Page 31 



WO 03/097810 



PCT/US03/15712 



acagctgggc tgcgccgaaa gctgaaacca acccatgtag agcgaaccct cttcggtgaa 420 

ggcgacggat catccctttc cactttcgac acaccgttgg gggtgctggg cggactctgc 480 

tgttgggaac acttacaacc tctttcgaaa tatgcgctct acgcacagaa cgaggaaata 540 

cacttcgccg cctggcctag cttcagcatc taccgtcaag cgacagaagt ccttggacca 600 

gaagtaaatg tcgcagcttc tcggatctac gccgtggaag ggcagtgttt tgttctcgct 660 

tcctgcgcgc tcgtctcgcc agagatgatc gaaatgctct gcactgacga aagcaagcac 720 

agccttcttc aggccggcgg cgggtactcc cgcattatcg gtcccgatgg cagcgaccta 780 

gcgcgcccct tgggcgaaaa cgaggaaggt attctctatg ccactctgga ccctgccgct 840 

cgaatctatg caaagaccgc agctgatcca gccgggcact actccagacc agacgtcact 900 

cggctgctga tcaatcgcag tgccaatcag ccagtcgtag aggttggaag ggaaatacct 960 

gcatcggccc aaggctttga agttgaggcg gcccccgggt acgaaggcga ttga 1014 



<210> 48 
<211> 337 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 48 



Val 


Lys 


Glu 


Ala 


He 


Lys 


Val 


Ala 


Cys Val 


Gin 


Ala 


Ala 


Pro 


Val 


Phe 


1 








5 








10 










15 




Leu 


Asp 


Leu 


Asp 


Ala 


Thr 


Val 


Asp 


Lys Thr 


Val 


Ala 


Leu 


He 


Glu 


Glu 








20 










25 








30 






Ala 


Ala 


Arg 


Asn 


Gly 


Ala 


Arg 


Leu 


He Ala 


Phe 


Pro 


Glu 


Thr 


Trp 


He 






35 










40 








45 








Pro 


Gly 


Tyr 


Pro 


Trp 


Phe 


Leu 


Trp 


Leu Asp 


Ser 


Pro 


Ala 


Trp 


Gly 


Met 




50 










55 








60 










Gin 


Phe 


Val 


Arg 


Arg 


Tyr 


His 


Glu 


Asn Ser 


Leu 


Val 


Leu 


Asp 


Ser 


Pro 


65 










70 








75 










80 


Gin 


Ala 


Lys 


Arg 


He 


Ser 


Glu 


Ala 


Ala Gin 


Arg 


Ala 


Gly 


He 


Tyr 


Val 










85 








90 










95 




Ala 


Leu 


Gly 


Tyr 


Ser 


Glu 


Arg 


Val 


Ser Gly 


Thr 


Leu 


Tyr 


Met 


Gly 


Gin 








100 










105 








110 






Trp 


Leu 


He 


Asp 


Asp 


Lys 


Gly 


Glu 


Thr Ala 


Gly 


Leu 


Arg 


Arg 


Lys 


Leu 






115 










120 








125 








Lys 


Pro 


Thr 


His 


Val 


Glu 


Arg 


Thr 


Leu Phe 


Gly 


Glu 


Gly 


Asp 


Gly 


Ser 




130 










135 








140 










Ser 


Leu 


Ser 


Thr 


Phe 


Asp 


Thr 


Pro 


Leu Gly 


Val 


Leu 


Gly 


Gly 


Leu 


Cys 


145 










150 








155 










160 


Cys 


Trp 


Glu 


His 


Leu 


Gin 


Pro 


Leu 


Ser Lys 


Tyr 


Ala 


Leu 


Tyr 


Ala 


Gin 










165 








170 










175 




Asn 


Glu 


Glu 


He 


His 


Phe 


Ala 


Ala 


Trp Pro 


Ser 


Phe 


Ser 


He 


Tyr 


Arg 








180 










185 








190 






Gin 


Ala 


Thr 


Glu 


Val 


Leu 


Gly 


Pro 


Glu Val 


Asn 


Val 


Ala 


Ala 


Ser 


Arg 






195 










200 








205 








lie 


Tyr 


Ala 


Val 


Glu 


Gly 


Gin 


Cys 


Phe Val 


Leu 


Ala 


Ser 


Cys 


Ala 


Leu 




210 










215 








220 










Val 


Ser 


Pro 


Glu 


Met 


He 


Glu 


Met 


Leu Cys 


Thr 


Asp 


Glu 


Ser 


Lys 


His 


225 










230 








235 










240 


Ser 


Leu 


Leu 


Gin 


Ala 


Gly 


Gly 


Gly 


Tyr Ser 


Arg 


He 


He 


Gly 


Pro 


Asp 










245 








250 










255 




Gly 


Ser 


Asp 


Leu 


Ala 


Arg 


Pro 


Leu 


Gly Glu 


Asn 


Glu 


Glu 


Gly 


He 


Leu 








260 










265 








270 






Tyr 


Ala 


Thr 


Leu 


Asp 


Pro 


Ala 


Ala 


Arg He 


Tyr 


Ala 


Lys 


Thr 


Ala 


Ala 






275 










280 








285 








Asp 


Pro 


Ala 


Gly 


His 


Tyr 


Ser 


Arg 


Pro Asp 


Val 


Thr 


Arg 


Leu 


Leu 


He 




290 










295 








300 










Asn 


Arg 


Ser 


Ala 


Asn 


Gin 


Pro 


Val 


Val Glu 


Val 


Gly 


Arg 


Glu 


He 


Pro 


305 










310 








315 










320 
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Ala Ser Ala Gin Gly Phe Glu Val Glu Ala Ala Pro Gly Tyr Glu Gly 
325 330 335 

Asp 



<210> 49 
<211> 1038 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 49 

atgaacaaag tcgtggctgc tgccgttcag tgcagcccgg tgctttactc ttgcgccgga 60 

actgtaaata aaatttgcga gtggattgca gatttgggca aacaaggggt tgagctggcg 120 

gtgttcgcgg aaaccctggt gccttactac ccgtattttt cttttatcca ggctccttgt 180 

gcgatgggcg cgcaacattt gttgttgatg caagaatcag tagaggttcc ttccatctac 240 

acgcaacaaa ttgccgctgc agcaaaagca gcgaagatgg tggtgtcagt tggtattaac 300 

gaacgcgacg gcggttctat ttataacgcg caattattat ttgatgcggg cggtcagctt 3 60 

gttcagcacc gccgaaaaat tacgccgaca tttcatgagc gcatggtgtg ggggcagggc 420 

gatggctccg gtttgtgcgc agtggatacg gcagttggtc gtgttggttc gctcgcttgc 480 

tgggaacatt acaacccact cgcgcgttac gcattgatgg cagatcgcga acaaattcac 540 

gtgagtatgt ttcccggttc tttggtcggc gaaatttttg ccgagcaaat tgaagcaact 600 

attcgtcacc acgcattgga gtccggttgc tttgtggtaa atgcgacggg ctggttaacg 660 

ccggaacagc aagctcaaat cgtaaaagat actggtggtc ctatcgctgc cattagcggt 720 

ggttgtttca ccgccattgt ttcaccggaa ggaaaattgc tcggcacgcc attgcgcagt 780 

gattccgggg agggtgcctg tatcgccgaa ctggatttta atctcatcaa taagcgtaag 840 

cgcatgatgg attctgtcgg ccattacagt cgtcctgaat tgctcagttt gctgattgat 900 

aaaacaccga caagtcatac acatccgctt aaaaaacctt tggctcccag tgaaaaaaat 960 

acgccagagg atatcgccac tggtttaaca ctggtcactc ccgtttcaaa tgcaaacctt 1020 

ttcagcgcaa gcaactag 1038 

<210> 50 

<211> 345 

<212> PRT 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 50 



Met 


Asn 


Lys 


Val 


Val 


Ala 


Ala 


Ala 


Val 


Gin 


Cys 


Ser 


Pro 


Val 


Leu 


Tyr 


1 








5 










10 










15 




Ser 


Cys 


Ala 


Gly 
20 


Thr 


Val 


Asn 


Lys 


He 
25 


Cys 


Glu 


Trp 


He 


Ala 
30 


Asp 


Leu 


Gly 


Lys 


Gin 
35 


Gly 


Val 


Glu 


Leu 


Ala 
40 


Val 


Phe 


Ala 


Glu 


Thr 
45 


Leu 


Val 


Pro 


Tyr 


Tyr 
50 


Pro 


Tyr 


Phe 


Ser 


Phe 
55 


lie 


Gin 


Ala 


Pro 


Cys 
60 


Ala 


Met 


Gly 


Ala 


Gin 


His 


Leu 


Leu 


Leu 


Met 


Gin 


Glu 


Ser 


Val 


Glu 


Val 


Pro 


Ser 


He 


Tyr 


65 










70 










75 










80 


Thr 


Gin 


Gin 


lie 


Ala 
85 


Ala 


Ala 


Ala 


Lys 


Ala 
90 


Ala 


Lys 


Met 


Val 


Val 
95 


Ser 


Val 


Gly 


lie 


Asn 
100 


Glu 


Arg 


Asp 


Gly 


Gly 
105 


Ser 


He 


Tyr 


Asn 


Ala 
110 


Gin 


Leu 


Leu 


Phe 


Asp 
115 


Ala 


Gly 


Gly 


Gin 


Leu 
120 


Val 


Gin 


His 


Arg 


Arg 
125 


Lys 


He 


Thr 


Pro 


Thr 
130 


Phe 


His 


Glu 


Arg 


Met 
135 


Val 


Trp 


Gly 


Gin 


Gly 
140 


Asp 


Gly 


Ser 


Gly 


Leu 


Cys 


Ala 


Val 


Asp 


Thr 


Ala 


Val 


Gly 


Arg 


Val 


Gly 


Ser 


Leu 


Ala 


Cys 
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145 








150 






155 160 


Trp 


Glu 


His 


Tyr Asn 


Pro 


T on Al s A vn 


lyr 


Ala T.eni Mpt" Ala Ast) Aro 
ni.a iicu ncL ma y 






165 






170 


175 


Glu 


Gin 


lie 


His Val 


Ser 


Mnf DKo DrA 

riei. trnts no 


oiy 


Cot* T.on Val ftl v Rill Tip 








180 




1 A** 




190 


Phe 


Ala 


Glu 


Gin lie 


Glu 


H-La inr lie 


Arg 


u{ o u{ o Hist T.oti f^l n Qpr 
HIS HI is n±a belt uxu oci 






i yo 






9nn 




205 


Gly Cys 


Pne 


Val Val 


Asn 


A±a xnr v»iy 


lrp 


bcu Ifll titJ oiu UAii uj.11 




210 








91 ^ 
£lO 






Ala 


Gin 


lie 


Val Lys Asp 


inr Lx-iy uiy 


Pro 


Ti D Din Ala Tip Qpr Hlv 
lie hi a nia lie uci wiy 


225 








230 






9**H 940 


Gly Cys 


Pne 


Thr Ala 


lie 


Vol del flU 


oiu 


vjxy iiys iicu uc ux ujiy aha. 








245 








255 


Pro 


Leu 


Arg 


Ser Asp 


Ser 


oJ.y i?iu o±y 


TV 1 -s 


vys lie nla V3JLU ueu 






260 




265 




270 


Phe 


Asn 


Leu 


lie Asn 


Lys 


Arg Lys Arg 


Met 


Met Asp Ser Val Gly His 






275 






280 




285 


Tyr 


Ser 


Arg 


Pro Glu 


Leu 


Leu Ser Leu 


Leu 


He Asp Lys Thr Pro Thr 


290 








295 




300 


Ser 


His 


Thr 


His Pro 


Leu 


Lys Lys Pro 


Leu 


Ala Pro Ser Glu Lys Asn 


305 








310 






315 320 


Thr 


Pro 


Glu 


Asp lie 


Ala 


Thr Gly Leu 


Thr 


Leu Val Thr Pro Val Ser 








325 






330 


335 


Asn 


Ala 


Asn 


Leu Phe 


Ser 


Ala Ser Asn 












340 




345 







<210> 51 
<211> 897 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 51 

gtgaaegtec gcgtcgcggt ggtgcaggcc acgccggccg tgetcgaegg gccggcgtcg 60 

gtgeggaagg cctgccgcct gateggegag gccgciggccg gcggcgcccg cctgatcgcc 120 

ctgcccgagg gcttcgtgcc catcatgccg cgctcctgct gggggcacca cttcgcgctg 180 

atcgcctcgc egaagtegge ggccctgcac cggcgcatct gggagaaege cgtcgacgtc 240 

ggcggcccgc tggcccgcga gctcggcgac gccgcgcgcc gcgcggacgc ctgggtggcc 300 

atcggggtga aegagegega cgcccgccgg ccgggcacgc tctggaacac getgetctgg 360 

ttcgcgcccg aegggagect ggcccggcgc caccgcaagc tcgtgcccac catgeacgag 420 

cgcacgttct gggggcaggg cgcgggcgac gacctcgagg cgctggccgc ggacttegge 480 

cgcctgggcg gectgatctg ctgggagaac ttcatgcccg ccgcgcgccg gcgcctgcac 540 

egggaegggg tcgacttcta cctggccccc aeggeggacg acegggacat ctgggtcgcc 600 

gegatgegea cgttcgcctt cgaggccggc gccttcgtcc tctcgccggt gcagtacctg 660 

cggaccgccg acttcccgga ggacttcccg ctgegegagg agctcgccga ctgccccgag 720 

gtccagttca ceggggggag cgtgatctgc gacccgtggg gcaacctcct ggcggggccg 780 

gtccacgggg gcgaggagat cctctacgcc gaetgegate tcgacctcgt cctcgaggcc 84 0 

cgacgggtgc tcgacacggc cggccactac gaccgcccgg acctcgcctc ggcctga 897 

<210> 52 
<211> 298 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 52 

Val Asn Val Arg Val Ala Val Val Gin Ala Thr Pro Ala Val Leu Asp 
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1 5 10 15 

Gly Pro Ala Ser Val Arg Lys Ala Cys Arg Leu lie Gly Glu Ala Ala 
J 30 



20 

Ala Gly Gly Ala Arg Leu lie Ala Leu Pro Glu Gly Phe Val Pro lie 

35 40 45 

Met Pro Arg Ser Cys Trp Gly His His Phe Ala Leu He Ala Ser Pro 

50 $5 60 

Lys Ser Ala Ala Leu His Arg Arg lie Trp Glu Asn Ala Val Asp Val 

65 7 0 75 

Gly Gly Pro Leu Ala Arg Glu Leu Gly Asp Ala Ala Arg Arg Ala Asp 

85 90 95 

Ala Trp Val Ala He Gly Val Asn Glu Arg Asp Ala Arg Arg Pro Gly 

100 105 
Thr Leu Trp Asn Thr Leu Leu Trp Phe Ala Pro Asp Gly Ser Leu Ala 

115 120 125 

Arc Arg His Arg Lys Leu Val Pro Thr Met His Glu Arg Thr Phe Trp 

135 I 40 
Asp Leu Glu Ala 

Cys Trp Glu Asn 
165 170 



130 

Gly Gin Gly Ala Gly Asp Asp Leu Glu Ala Leu Ala Ala Asp Phe Gly 
145 150 155 

Arg Leu Gly Gly Leu lie Cys Trp Glu Asn Phe Met Pro Ala Ala Arg 

165 no 175 

Arg Arg Leu His Arg Asp Gly Val Asp Phe Tyr Leu Ala Pro Thr Ala 



180 j_ v 

Asp Asp Arg Asp He Trp Val Ala Ala Met Arg Thr Phe Ala Phe Glu 

r onn 205 



185 19° 



Ala Gly Ala Phe Val Leu Ser Pro Val Gin Tyr Leu Arg Thr Ala Asp 



200 205 



215 220 



210 

Phe Pro Glu Asp Phe Pro Leu Arg Glu Glu Leu Ala Asp Cys Pro Glu 
225 230 235 *™ 

Val Gin Phe Thr Gly Gly Ser Val He Cys Asp Pro Trp Gly Asn Leu 

245 250 ^ 

Leu Ala Gly Pro Val His Gly Gly Glu Glu lie Leu Tyr Ala Asp Cys 

260 265 270 

Asp Leu Asp Leu Val Leu Glu Ala Arg Arg Val Leu Asp Thr Ala Gly 

275 280 285 

His Tyr Asp Arg Pro Asp Leu Ala Ser Ala 
290 295 

<210> 53 
<211> 954 
<212> DMA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

aJggJcaatt caaagagtca gattaaaatc gcggcggtgc aggctgcccc Wtctttctg 60 
gaa^gggagg cgacgattga caaagcttgc oggWttg WW™ ^tgggcc 1*0 
gcgaatctgg tggtcttccc tgagtcattc gtcccggctt atcccgatcg ^ctgggc 
gt?ccggcag gtgaaacaac gctcctgaac acgctctatg ccgaactgct Wccaatgcc 240 
attaaaattc cgggtccggc gacagagcgg ctgagccagg cagccaacct ggccggggtt 300 
Sgtcgcc-a t?ggcttgac cgagcggaac atcgaggcca |tggggcgag cctgtacaat 60 
actttactct ttctcgactc agccggcggc atgttaggca agcatcgcaa actgatcccc 
accaaScg agcgcctggt ctgggctcag ggtgatggca gcactctggc ggtgtacgag 480 
actS?tg Saaaatggg agigttgatt tgctgggaga attacatgcc cctggcccgt 540 
SS? atgcctgggg gaogcagatt tacatcgcgg ccacctggga tcgaggcgag 600 
ccgtggctgt caacgctgcg gcatatcgoo gcggaaggcc gggttgttgt cgtcggctgt «»u 
ggca?ggccc tgcgcaaagc cgacctgccc g-cgctttg -ctcaagca 9£«ttttac 720 
cagaaogccg atgagtggat caatgtcggc gacagcgcga "gttaaccc |J«*«JJ 
ttcatcgccg ggccgctgcg cgagoaggaa ggcatcctct atgctgagat tgatctggcc 
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cagatgcgcg gccccaaatg gatgctcgac gtggccggcc attacgctcg cccggatgtg 900 
tttgaactca tcgttcatcg ggaggcgcgg cccatgattg cgctaatttc atga 954 

<210> 54 
<211> 317 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 54 

Met Val Asn Ser Lys Ser Gin lie Lys lie Ala Ala Val Gin Ala Ala 

15 10 15 

Pro Val Phe Leu Glu Arg Glu Ala Thr lie Asp Lys Ala Cys Arg Leu 

20 25 30 

lie Ala Glu Ala Gly Glu Gin Gly Ala Asn Leu Val Val Phe Pro Glu 

35 40 45 

Ser Phe Val Pro Ala Tyr Pro Asp Trp Val Trp Ala Val Pro Ala Gly 

50 55 60 

Glu Thr Thr Leu Leu Asn Thr Leu Tyr Ala Glu Leu Leu Ala Asn Ala 
65 70 75 80 

Val Glu lie Pro Gly Pro Ala Thr Glu Arg Leu Ser Gin Ala Ala Asn 

85 90 95 

Leu Ala Gly Val Tyr Val Ala lie Gly Leu Thr Glu Arg Asn He Glu 

100 105 110 

Ala Ser Gly Ala Ser Leu Tyr Asn Thr Leu Leu Phe Leu Asp Ser Ala 

115 120 125 

Gly Gly Met Leu Gly Lys His Arg Lys Leu He Pro Thr Gly Gly Glu 

130 135 140 

Arg Leu Val Trp Ala Gin Gly Asp Gly Ser Thr Leu Ala Val Tyr Glu 
145 150 155 160 

Thr Arg Phe Gly Lys Met Gly Gly Leu He Cys Trp Glu Asn Tyr Met 

165 170 175 

Pro Leu Ala Arg Tyr Ala Leu Tyr Ala Trp Gly Thr Gin He Tyr He 

180 185 190 

Ala Ala Thr Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu Arg His 

195 200 205 

He Ala Ala Glu Gly Arg Val Val Val Val Gly Cys Gly Met Ala Leu 

210 215 220 

Arg Lys Ala Asp Leu Pro Asp Arg Phe Glu Leu Lys Gin Arg Phe Tyr 
225 230 235 240 

Gin Asn Ala Asp Glu Trp He Asn Val Gly Asp Ser Ala He Val Asn 

245 250 255 

Pro Asp Gly Glu Phe He Ala Gly Pro Leu Arg Glu Gin Glu Gly He 

260 265 270 

Leu Tyr Ala Glu He Asp Leu Ala Gin Met Arg Gly Pro Lys Trp Met 

275 280 285 

Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Glu Leu He 

290 295 300 

Val His Arg Glu Ala Arg Pro Met He Ala Leu He Ser 
305 310 315 

<210> 55 
<211> 1017 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
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<400> 55 

atgggtatcg aacatccgaa gtacaaggtc gccgtggtgc aggcagctcc cgcctggctc 60 

gatctcgacg cgtcgatcga caagtcgatc gcgctgatcg aggaggcggc ccaaaaaggc 120 

gccaagctga tcgcattccc cgaggccttc attcccggct atccctggca catctggatg 180 

gactcgccgg cctgggcgat cggccgcggc tttgtgcagc gctattttga caattcgctc 240 

gcctatgaca gcccgcaggc cgagaagctg cgcgcggcgg tgcgcaaggc aaagctcacc 300 

gccgtgctcg ggctgtccga gcgcgacggc ggcagtctct atctggcgca atggttgatc 360 

gggcccgatg gcgagaccat cgcaaaacgc cgcaagctgc ggccgacaca tgccgagcgc 420 

acggtgtacg gcgagggcga cggcagcgat ctcgcagtcc acaaccgtcc cgatatcggc 480 

cgcctcggcg cgctctgctg ctgggagcat ttgcagccac tgtcgaaata cgcgatgtac 540 

gcgcagaacg agcaggtgca tgtcgcggcc tggccgagct tttcgctcta cgatcccttt 600 

gcggtggcgc tcggcgccga ggtgaacaac gcggcctcgc gcgtctatgc agtcgaaggc 660 

tcctgcttcg tgctggcgcc atgcgccacc gtctcgcagg ccatgatcga cgagctctgc 720 

gaccgaccgg acaagcatac gctgctgcat gtcggcggcg gttttgccgc gatctatggt 780 

cctgacggca gccagatcgg cgacaagctc gcgcccgacc aggaagggct gttgatcgcg 840 

gagatcgacc ttggggccat tggcgtcgcc aagaacgcgg ccgatcccgc cgggcattat 900 

tcgcggcccg acgtgacgcg gctcctgctc aacaagaaac cgtacaagcg cgtcgagcag 960 

ttctcgccac cggccgaggc ggtcgagccc acagatatcg cagcggcggc aagctga 1017 

<210> 56 
<211> 338 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 56 



Met 


Gly 


He 


Glu 


His 


Pro 


Lys 


Tyr 


Lys 


Val 


Ala Val 


Val 


Gin 


Ala 


Ala 


1 








5 










10 








15 




Pro 


Ala 


Trp 


Leu 


Asp 


Leu 


Asp 


Ala 


Ser 


He Asp Lys 


Ser 


He 


Ala 


Leu 








20 










25 








30 






lie 


Glu 


Glu 


Ala 


Ala 


Gin 


Lys 


Gly 


Ala Lys 


Leu He 


Ala 


Phe 


Pro 


Glu 






35 










40 








45 








Ala 


Phe 


He 


Pro 


Gly 


Tyr 


Pro 


Trp 


His 


He Trp Met 


Asp 


Ser 


Pro 


Ala 




50 










55 








60 










Trp 


Ala 


He 


Gly 


Arg 


Gly 


Phe 


Val 


Gin Arg Tyr Phe 


Asp 


Asn 


Ser 


Leu 


65 










70 










75 








80 


Ala 


Tyr 


Asp 


Ser 


Pro 


Gin 


Ala 


Glu 


Lys Leu Arg Ala 


Ala 


Val 


Arg 


Lys 










85 










90 








95 




Ala 


Lys 


Leu 


Thr 


Ala 


Val 


Leu 


Gly 


Leu 


Ser 


Glu Arg 


Asp 


Gly 


Gly 


Ser 








100 










105 








110 






Leu 


Tyr 


Leu 


Ala 


Gin 


Trp 


Leu 


He 


Gly Pro Asp Gly 


Glu 


Thr 


He 


Ala 




115 










120 








125 








Lys 


Arg 


Arg 


Lys 


Leu 


Arg 


Pro 


Thr 


His Ala Glu Arg 


Thr 


Val 


Tyr 


Gly 




130 










135 








140 










Glu 


Gly 


Asp 


Gly 


Ser 


Asp 


Leu 


Ala 


Val 


His 


Asn Arg 


Pro 


Asp 


He 


Gly 


145 










150 










155 








160 


Arg 


Leu 


Gly 


Ala 


Leu 


Cys 


Cys 


Trp 


Glu 


His 


Leu Gin 


Pro 


Leu 


Ser 


Lys 








165 










170 








175 




Tyr 


Ala 


Met 


Tyr 


Ala 


Gin 


Asn 


Glu 


Gin 


Val 


His Val 


Ala 


Ala 


Trp 


Pro 






180 










185 








190 






Ser 


Phe 


Ser 


Leu 


Tyr 


Asp 


Pro 


Phe 


Ala 


Val 


Ala Leu 


Gly 


Ala 


Glu 


Val 






195 










200 








205 








Asn 


Asn 


Ala 


Ala 


Ser 


Arg 


Val 


Tyr 


Ala 


Val 


Glu Gly 


Ser 


Cys 


Phe 


Val 




210 










215 








220 










Leu 


Ala 


Pro 


Cys 


Ala 


Thr 


Val 


Ser 


Gin 


Ala 


Met He 


Asp 


Glu 


Leu 


Cys 


225 










230 










235 








240 


Asp 


Arg 


Pro 


Asp 


Lys 


His 


Thr 


Leu 


Leu 


His 


Val Gly 


Gly 


Gly 


Phe 


Ala 








245 










250 








255 




Ala 


lie 


Tyr 


Gly 


Pro 


Asp 


Gly 


Ser 


Gin He Gly Asp 


Lys 


Leu 


Ala 


Pro 
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260 




Asp 


Gin Glu 


Gly 


Leu 




275 






Val Ala Lys 


Asn 


Ala 




290 






Val 


Thr Arg 


Leu 


Leu 


305 








Phe 


Ser Pro 


Pro 


Ala 








325 


Ala 


Ser 







265 

Leu lie Ala Glu lie Asp 
280 

Ala Asp Pro Ala Gly His 
295 

Leu Asn Lys Lys Pro Tyr 
310 315 
Glu Ala Val Glu Pro Thr 
330 





270 






Leu Gly Ala 


He 


Gly 




285 






Tyr 


Ser Arg 


Pro 


Asp 


300 








Lys 


Arg Val 


Glu 


Gin 








320 


Asp 


He Ala 


Ala 


Ala 




335 





<210> 57 
<211> 1014 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

Jtgaaagaag caatcaaagt agcctgtgtg caagcagctc cagtctttct cgacctggac 60 
gccacagtgg acaagaccgt cgccctgatt gaggaggcag cccgtaacgg cgcacgccta 120 
atcgcctttc cagagacctg gattccaggc tacccatggt tcctttggct ggactcacca 180 
gcctggggga tgcaattcgt gcgccgatac cacgagaact cactggtcct cgacagccct 240 
caggccaagc gcatcagtga ggccgcccag cgcgccggta tatacgtcgc gctagggtac 300 
agcgaacgcg tgagcggaac cctctacatg gggcagtggc tcattgacga taagggcgaa 360 
acagctgggc tgcgccgaaa gctgaaacca acccatgtag agcgaaccct cttcggtgaa 420 
ggcgacggat catccctttc cactttcgac acaccgttgg gggtgctggg cggactctgc 480 
tqttgggaac acttacaacc tctttcgaaa tatgcgctct acgcacagaa cgaggaaata 540 
cacttcgccg cctggcctag cttcagcatc taccgtcaag cgacagaagt ccttggacca 600 
gaagtaaatg tcgcagcttc tcggatctac gccgtggaag ggcagtgttt tgttctcgct 660 
tcctgcgcgc tcgtctcgcc agagatgatc gaaatgctct gcactgacga aagcaagcac 
agccttcttc aggccggcgg cgggtactcc cgcattatcg gtcccgatgg cagcgaccta 
gcgcgcccct tgggcgaaaa cgaggaaggt attctctatg ccactctgga ccctgccgct 
cgaatctatg caaagaccgc agctgatcca gccgggcact actccagacc agacgtcact 
cggctgctga tcaatcgcag tgccaatcag ccagtcgtag aggttggacg ggaaatacct 
gcatcggccc aaggctttga agttgaggcg gcccccgggt acggaggcga ttga 

<210> 58 
<211> 337 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

Vai°Lys 8 Glu Ala He Lys Val Ala Cys Val Gin Ala Ala Pro Val Phe 

1 5 10 15 

Leu Asp Leu Asp Ala Thr Val Asp Lys Thr Val Ala Leu He Glu Glu 

20 25 30 

Ala Ala Arg Asn Gly Ala Arg Leu lie Ala Phe Pro Glu Thr Trp He 

35 40 45 

Pro Gly Tyr Pro Trp Phe Leu Trp Leu Asp Ser Pro Ala Trp Gly Met 

50 55 6° 

Gin Phe Val Arg Arg Tyr His Glu Asn Ser Leu Val Leu Asp Ser Pro 
65 70 75 80 

Gin Ala Lys Arg He Ser Glu Ala Ala Gin Arg Ala Gly He Tyr Val 

85 90 95 

Ala Leu Gly Tyr Ser Glu Arg Val Ser Gly Thr Leu Tyr Met Gly Gin 



720 
780 
840 
900 
960 
1014 
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100 105 
Trp Leu lie Asp Asp Lys Gly Glu Thr Ala 

115 120 
Lys Pro Thr His Val Glu Arg Thr Leu Phe 

130 135 
Ser Leu Ser Thr Phe Asp Thr Pro Leu Gly 
145 150 
Cys Trp Glu His Leu Gin Pro Leu Ser Lys 
165 170 
Asn Glu Glu lie His Phe Ala Ala Trp Pro 

180 185 
Gin Ala Thr Glu Val Leu Gly Pro Glu Val 

195 200 
lie Tyr Ala Val Glu Gly Gin Cys Phe Val 

210 215 
Val Ser Pro Glu Met lie Glu Met Leu Cys 
225 230 
Ser Leu Leu Gin Ala Gly Gly Gly Tyr Ser 
245 250 
Gly Ser Asp Leu Ala Arg Pro Leu Gly Glu 

260 265 
Tyr Ala Thr Leu Asp Pro Ala Ala Arg lie 

275 280 
Asp Pro Ala Gly His Tyr Ser Arg Pro Asp 

290 295 
Asn Arg Ser Ala Asn Gin Pro Val Val Glu 
305 310 
Ala Ser Ala Gin Gly Phe Glu Val Glu Ala 
325 330 

Asp 



110 

Gly. Leu Arg Arg Lys Leu 
125 

Gly Glu Gly Asp Gly Ser 
140 

Val Leu Gly Gly Leu Cys 
155 160 
Tyr Ala Leu Tyr Ala Gin 
175 

Ser Phe Ser lie Tyr Arg 
190 

Asn Val Ala Ala Ser Arg 
205 

Leu Ala Ser Cys Ala Leu 
220 

Thr Asp Glu Ser Lys His 
235 240 
Arg He He Gly Pro Asp 
255 

Asn Glu Glu Gly He Leu 
270 

Tyr Ala Lys Thr Ala Ala 
285 

Val Thr Arg Leu Leu He 
300 

Val Gly Arg Glu He Pro 
315 320 
Ala Pro Gly Tyr Gly Gly 
335 



<210> 59 
<211> 987 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 59 

atgcgagata ggaatttcaa actggcggcc attcaggcgg agccggtttt ctttaatcgc 60 

cgggcctcga cggaaaaggc ctgcagattg atcaaagaag cgggcgcgat gggcgccgat 120 

atcgcgggat tcagcgagac ctggcttccc gggtatccct tttttatctg gggcgcaagc 180 

gccgatccat ccctgctctg gaaggcttct gcggaatacc tggccaatgc cgttcaaata 240 

cccggtcccg agacggatca attatgcgag gcggcgaaaa aggccggcat cgatgtggcg 300 

atcggagtgg ttgaactcga cgagtttacg aagggaacgg cttactgcac gctgctcttc 360 

atcggcaaag aagggaagat cctgggaaag caccgcaaac tcaagccgac gcaccgggag 420 

cgcacggtat ggggagaggg cgatgcgacg ggactcagtg tccatgagcg tccttacggg 480 

cggatcagcg gcctgaactg ctgggagcat aatatggtcc tgcccggcta tgtcctgatg 540 

tctcagggca cgcacattca tatcgcggcc tggccgggtt cggaagggaa agcacctccc 600 

gcgccgtctc cgatgtggga gcgccagctt ctgctctccc gcgctttcgc ttcgcaatcc 660 

gccgcatacg tgattctggt cggaggactc ctgaacccgc agaatattcc ggcgccctac 720 

gatgaacttg ccgtcaagta ccggggagac agtttcatca tcgatccgcg cggggagatc 780 

atcgccgggc cggccaaggg ggaaaccatt ctcatcgccg aaggctcgat ggaacaggtc 840 

ctcgcggcaa agtccgcctt cgatgtcgcg ggacattatt cccgccccga cgtctttcaa 900 

ctctgcgtca accgcaaacc gtaccggcgt gtaagggaaa cttcggagca ggaccaaccc 960 

gcttctgaaa gagaatcgga atcgtaa 987 

<210> 60 
<211> 328 
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<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 60 

Met Arg Asp Arg Asn Phe Lys Leu Ala Ala He Gin Ala Glu Pro Val 

15 10 15 

Phe Phe Asn Arg Arg Ala Ser Thr Glu Lys Ala Cys Arg Leu He Lys 

20 25 30 

Glu Ala Gly Ala Met Gly Ala Asp He Ala Gly Phe Ser Glu Thr Trp 

35 40 45 

Leu Pro Gly Tyr Pro Phe Phe He Trp Gly Ala Ser Ala Asp Pro Ser 

50 55 60 

Leu Leu Trp Lys Ala Ser Ala Glu Tyr Leu Ala Asn Ala Val Gin He 
65 70 75 80 

Pro Gly Pro Glu Thr Asp Gin Leu Cys Glu Ala Ala Lys Lys Ala Gly 

85 90 95 

He Asp Val Ala He Gly Val Val Glu Leu Asp Glu Phe Thr Lys Gly 

100 105 110 

Thr Ala Tyr Cys Thr Leu Leu Phe He Gly Lys Glu Gly Lys He Leu 

115 120 125 

Gly Lys His Arg Lys Leu Lys Pro Thr His Arg Glu Arg Thr Val Trp 

130 135 140 

Gly Glu Gly Asp Ala Thr Gly Leu Ser Val His Glu Arg Pro Tyr Gly 
145 150 155 160 

Arg He Ser Gly Leu Asn Cys Trp Glu His Asn Met Val Leu Pro Gly 

165 170 175 

Tyr Val Leu Met Ser Gin Gly Thr His He His He Ala Ala Trp Pro 

180 185 190 

Gly Ser Glu Gly Lys Ala Pro Pro Ala Pro Ser Pro Met Trp Glu Arg 

195 200 205 

Gin Leu Leu Leu Ser Arg Ala Phe Ala Ser Gin Ser Ala Ala Tyr Val 

210 215 220 

He Leu Val Gly Gly Leu Leu Asn Pro Gin Asn He Pro Ala Pro Tyr 
225 230 235 240 

Asp Glu Leu Ala Val Lys Tyr Arg Gly Asp Ser Phe He He Asp Pro 

245 250 255 

Arg Gly Glu He He Ala Gly Pro Ala Lys Gly Glu Thr He Leu He 

260 265 270 

Ala Glu Gly Ser Met Glu Gin Val Leu Ala Ala Lys Ser Ala Phe Asp 

275 280 285 

Val Ala Gly His Tyr Ser Arg Pro Asp Val Phe Gin Leu Cys Val Asn 

290 295 300 

Arg Lys Pro Tyr Arg Arg Val Arg Glu Thr Ser Glu Gin Asp Gin Pro 
305 310 315 320 

Ala Ser Glu Arg Glu Ser Glu Ser 
325 

<210> 61 
<211> 966 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 61 

atgactcgat cttacccgaa tgacacactc acggttgggc ttgcgcaaat tgctccagtc 60 
tggttggatc gtacagggac aatttcaaag atattagctc aagtccatgc ggcaaatgaa 120 
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gcgggctgtc atcttgtcgc gtttggcgaa ggtctcctcc ccggatatcc gttttggatt 180 

gagcgaacaa atggtgcagt cttcaactcg cccacgcaga aagaaattca cgcgcattat 240 

ctggatcagg ctgtccagat cgaagcaggt catcttgagg cgctttgcga agcagccaag 300 

gaatatgaga tcgcaattgt cctgggatgc attgaacgtc cgcaagatcg tggagggcac 360 

agtctgtatg caagccttgt atatattgat tcagacggca tcatccaatc tgtgcatcga 420 

aagttaatgc caacatatga agaacggctc acctggtcgc caggtgacgg acatggatta 480 

cgggtgcaca aattaggtgc ctttacggtt ggcggcctca actgttggga aaactggatg 540 

cctttggcac gcgcggccat gtatggtcaa ggcgaggatt tgcatattgc catttggccc 600 

ggcggctccc acaatacgca agacattaca cgctttattg cactagaatc gcgttcctat 660 

gttttatctg tgtcaggttt aatgcgctca ggcgattttc caaaagagac cccacatctt 720 

gcatccatcc tggctaaagg tgaggatatt cttgccaacg gtggttcatg tatcgccggt 780 

cctgacggca aatggatcgt tgagccgctt gtaggagaag agaagttaat tgttgcaacg 840 

attgatcatt gtcgtgtgcg cgaagagcgt caaaattttg atccttcagg acattacagc 900 

aggccagatg tattgcaact gaaaataaac aggcaacgcc agagtacaat ctcgtttgga 960 

. 966 
gagtaa 

<210> 62 
<211> 321 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<400> 62 , 

Met Thr Arg Ser Tyr Pro Asn Asp Thr Leu Thr Val Gly Leu Ala Gin 

15 10 15 

lie Ala Pro Val Trp Leu Asp Arg Thr Gly Thr He Ser Lys He Leu 

20 25 30 

Ala Gin Val His Ala Ala Asn Glu Ala Gly Cys His Leu Val Ala Phe 

35 40 45 

Gly Glu Gly Leu Leu Pro Gly Tyr Pro Phe Trp He Glu Arg Thr Asn 

50 55 60 

Glv Ala Val Phe Asn Ser Pro Thr Gin Lys Glu He His Ala His Tyr 
65 70 75 80 

Leu Asp Gin Ala Val Gin He Glu Ala Gly His Leu Glu Ala Leu Cys 

85 90 95 

Glu Ala Ala Lys Glu Tyr Glu He Ala He Val Leu Gly Cys He Glu 

100 105 HO 

Arg Pro Gin Asp Arg Gly Gly His Ser Leu Tyr Ala Ser Leu Val Tyr 

115 120 125 

He Asp Ser Asp Gly He He Gin Ser Val His Arg Lys Leu Met Pro 

130 135 140 

Thr Tyr Glu Glu Arg Leu Thr Trp Ser Pro Gly Asp Gly His Gly Leu 
145 150 155 160 

Arq Val His Lys Leu Gly Ala Phe Thr Val Gly Gly Leu Asn Cys Trp 

165 170 175 

Glu Asn Trp Met Pro Leu Ala Arg Ala Ala Met Tyr Gly Gin Gly Glu 

180 185 190 

Asp Leu His He Ala He Trp Pro Gly Gly Ser His Asn Thr Gin Asp 

195 200 205 

He Thr Arg Phe He Ala Leu Glu Ser Arg Ser Tyr Val Leu Ser Val 

210 215 220 

Ser Gly Leu Met Arg Ser Gly Asp Phe Pro Lys Glu Thr Pro His Leu 
225 230 235 240 

Ala Ser He Leu Ala Lys Gly Glu Asp He Leu Ala Asn Gly Gly Ser 

245 250 255 

Cys lie Ala Gly Pro Asp Gly Lys Trp He Val Glu Pro Leu Val Gly 

260 265 270 

Glu Glu Lys Leu He Val Ala Thr He Asp His Cys Arg Val Arg Glu 
275 280 285 
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Glu Arg Gin Asn Phe Asp Pro Ser Gly His Tyr Ser Arg Pro Asp Val 

290 295 300 

Leu Gin Leu Lys He Asn Arg Gin Arg Gin Ser Thr He Ser Phe Gly 
305 310 315 320 

Glu 



<210> 63 
<211> 978 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 63 

atgcaagata 

gattgtgacg 

gcaaacctga 

ggtcgaggtc 

aacaattcgg 

gctcgagcca 

aacaccctgc 

cccaccaatc 

gatacaccct 

cgttatgcac 

gagatcactc 

gtgtgcatga 

gcggaggcag 

gtgctggcgg 

cgaattgtgg 

ctacgactgc 

gatccgtctg 



gagtaccgat 
cgaccgtgga 
tcgtgtttcc 
tggccacttt 
tggaaatccc 
ccgttgtcat 
tatttattgg 
acgaacggat 
gtggaaaagt 
tctatggaca 
tggtcaatgc 
tccttcgcaa 
atgacttcat 
gtccattgtg 
atgagagacg 
actttaatgc 
agggttaa 



tgtacgagct 
aaaagcctgc 
cgaagccttc 
tggcggacag 
tggtccggcc 
gggattgaat 
gccagacggt 
gatctggggt 
aggcggtctc 
gggcgaacaa 
acgcaatacc 
gtccagcttt 
aaaatcaggc 
gaatgaagag 
agtatttgat 
ttcccctcag 



gcggctatcc 
cgattgatcg 
attcccgttt 
cgccagaaat 
accgacaggc 
gagcgcgcgg 
cgcttgctgg 
atgggagatg 
atctgctggg 
atccatgtcg 
gcctatgagg 
ccccatgatt 
ggcagcgcga 
aatatactgt 
gtgacgggcc 
aaaactattg 



aggctgaacc 
gtgaagcagc 
atcccaatgc 
acgtatggac 
tggcaaaggc 
tcgataacaa 
gcaagcaccg 
ggagcaccct 
aaaactacat 
cgccgactgc 
gacgcttatt 
ttgagctggg 
tcgttgggcc 
atgccgatct 
attattcacg 
aaagatatga 



catagtcctt 
agaaaatggt 
ggcgatctgg 
gcgactatgg 
agcacacgag 
cacgctttac 
taagctcatg 
gcgggttttt 
gcctctggcg 
gcacgatggt 
cgtcatctcc 
cgaggaattg 
agatggcgag 
tgacttgaat 
tccagatgtt 
gcaacctctc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
978 



<210> 64 
<211> 325 
<212> PRT 
<213> Unknown 

<220> 1 

<223> Obtained from an environmental sample 



<400> 64 












Met 


Gin Asp 


Arg Val Pro He 


Val 


Arg Ala 


Ala Ala He Gin Ala 


Glu 


1 


5 




10 


15 




Pro 


He Val 


Leu Asp Cys Asp 


Ala 


Thr Val 


Glu Lys Ala Cys Arg 


Leu 






20 




25 


30 




He 


Gly Glu 


Ala Ala Glu Asn 


Gly 


Ala Asn 


Leu He Val Phe Pro- 


Glu 




35 




40 




45 




Ala 


Phe He 


Pro Val Tyr Pro 


Asn 


Ala Ala 


He Trp Gly Arg Gly Leu 




50 


55 






60 




Ala 


Thr Phe 


Gly Gly Gin Arg 


Gin 


Lys Tyr 


Val Trp Thr Arg Leu Trp 


65 




70 






75 


80 


Asn 


Asn Ser 


Val Glu He Pro 


Gly 


Pro Ala 


Thr Asp Arg Leu Ala 


Lys 






85 




90 


95 




Ala 


Ala His 


Glu Ala Arg Ala 


Thr 


Val Val 


Met Gly Leu Asn Glu 


Arg 






100 




105 


110 




Ala 


Val Asp 


Asn Asn Thr Leu 


Tyr 


Asn Thr 


Leu Leu Phe He Gly 


Pro 




115 




120 




125 




Asp 


Gly Arg 


Leu Leu Gly Lys 


His 


Arg Lys 


Leu Met Pro Thr Asn 


His 




130 


135 






140 
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Glu Arq 


Met 


lie Trp Gly Met 


Gly Asp Gly Ser Thr Leu Arg 


Val 


Phe 


145 






150 


155 




160 


Asp 


Thr 


Pro 


Cys Gly Lys Val 


Gly Gly Leu He Cys Trp Glu 


Asn 


Tyr 






165 


170 


175 




Met 


Pro 


Leu 


Ala Arg Tyr Ala 


Leu Tyr Gly Gin Gly Glu Gin 


He 


His 








180 


185 190 






Val 


Ala 


Pro 


Thr Ala His Asp 


Gly Glu He Thr Leu Val Asn 


Ala 


Arg 






195 




200 205 






Asn 


Thr 


Ala 


Tyr Glu Gly Arg 


Leu Phe Val He Ser Val Cys 


Met 


He 




210 




215 


220 






Leu Arq 


Lys 


Ser Ser Phe Pro 


His Asp Phe Glu Leu Gly Glu 


Glu 


Leu 


225 




230 


235 




240 


Ala 


Glu 


Ala 


Asp Asp Phe He 


Lys Ser Gly Gly Ser Ala He 


Val 


Gly 








245 


250 


255 




Pro 


Asp 


Gly 


Glu Val Leu Ala 


Gly Pro Leu Trp Asn Glu Glu 


Asn 


He 




260 


265 270 






Leu 


Tyr 


Ala 


Asp Leu Asp Leu 


Asn Arg He Val Asp Glu Arg Arg Val 




275 




280 285 




His 


Phe 


Asp 


Val 


Thr Gly His Tyr 


Ser Arg Pro Asp Val Leu Arg 


Leu 




290 




295 


300 






Phe 


Asn 


Ala 


Ser Pro Gin Lys 


Thr He Glu Arg Tyr Glu Gin 


Pro 


Leu 


305 






310 


315 




320 


Asp 


Pro 


Ser 


Glu Gly 









325 



<210> 65 
<211> 1002 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 65 

atgccgaccc ccacttcaaa attcaaaatc ggcgccgtgc aggcatcgcc ggtttttctg 60 

gaccgggaag ccactgcgca aaaagcctgc aaattgattg ccgaagcggg agggcagggc 120 

gcgcggctga tcgttttccc ggagtctttc attcccacct atcctgattg ggtctgggcg 180 

gtcccgccgg gaagaggaaa agtgttaagc gaactttacg ccgagctgct ggccaatgcc 240 

gtggaagtcc ccgggccggt caccgatcag ctgggtgaag cagcccaaaa aacgggcgcc 300 

tatgtcgtca tgggcgtcac ggaaaaggac accgacgcaa gcggcgcgag cctttacaac 360 

acgctcctct atttcaaccc cgcgggggac ctcctgggaa aacaccggaa gcttgttcct 420 

accggcgggg agcggctggt ctgggcgcag ggcgacggca gcaccctgga agtgtacgac 480 

actcccctgg gaaaaatcgg aggcctcatc tgctgggaaa actacatgcc cctcgcccgg 540 

tacacgatgt atgcctgggg gacccagatt tatatcgcgg ccacatggga ccagggggag 600 

acgtggcttg ccaccctgcg gcatatcgct aaggaaggac gggtgtacgt catcggctgc 660 

tgcatcgcgc tgcggcggga cgacatcccc gaccggctgg aatacaagaa gaagttctac 720 

tcggggtcgc gggaatggat caatatgggg gacagcgcca tcgtgaaccc ggaaggcgaa 780 

ttcattgccg gccccgtgcg gatgaaggag gagatcctgt atgccgaggt ggaccccctc 840 

ctgatggcgg gatcgaaatg gatgctcgac gtcgcggggc attacgcgcg ccccgacgtc 900 

tttgaactca tcgtccaccg ccagccccac ccgatgatcc gggtaatcga gaaagaggga 960 

ggggccggaa gaaccgggga cgagaagaag gaaaatgagt ga 1002 

<210> 66 
<211> 333 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 66 
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Met Pro Thr Pro Thr Ser Lys Phe Lys lie Gly Ala Val Gin Ala Ser 

15 10 15 

Pro Val Phe Leu Asp Arg Glu Ala Thr Ala Gin Lys Ala Cys Lys Leu 

20 25 30 

He Ala Glu Ala Gly Gly Gin Gly Ala Arg Leu He Val Phe Pro Glu 

35 40 45 

Ser Phe He Pro Thr Tyr Pro Asp Trp Val Trp Ala Val Pro Pro Gly 

50 55 60 

Arg Gly Lys Val Leu Ser Glu Leu Tyr Ala Glu Leu Leu Ala Asn Ala 
65 70 75 80 

Val Glu Val Pro Gly Pro Val Thr Asp Gin Leu Gly Glu Ala Ala Gin 

85 90 95 

Lys Thr Gly Ala Tyr Val Val Met Gly Val Thr Glu Lys Asp Thr 'Asp 

100 105 110 

Ala Ser Gly Ala Ser Leu Tyr Asn Thr Leu Leu Tyr Phe Asn Pro Ala 

115 120 125 

Gly Asp Leu Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly Gly Glu 

130 135 140 

Arg Leu Val Trp Ala Gin Gly Asp Gly Ser Thr Leu Glu Val Tyr Asp 
145 150 155 160 

Thr Pro Leu Gly Lys He Gly Gly Leu He Cys Trp Glu Asn Tyr Met 

165 170 175 

Pro Leu Ala Arg Tyr Thr Met Tyr Ala Trp Gly Thr Gin He Tyr He 

180 185 190 

Ala Ala Thr Trp Asp Gin Gly Glu Thr Trp Leu Ala Thr Leu Arg His 

195 200 205 

He Ala Lys Glu Gly Arg Val Tyr Val He Gly Cys Cys He Ala Leu 

210 215 220 

Arg Arg Asp Asp He Pro Asp Arg Leu Glu Tyr Lys Lys Lys Phe Tyr 
225 ' 230 235 240 

Ser Gly Ser Arg Glu Trp He Asn Met Gly Asp Ser Ala He Val Asn 

245 250 255 

Pro Glu Gly Glu Phe lie Ala Gly Pro Val Arg Met Lys Glu Glu He 

260 265 270 

Leu Tyr Ala Glu Val Asp Pro Leu Leu Met Ala Gly Ser Lys Trp Met 

275 280 285 

Leu Asp Val Ala Gly His Tyr Ala Arg Prp Asp Val Phe Glu Leu He 

290 295 300 

Val His Arg Gin Pro His Pro Met He Arg Val He Glu Lys Glu Gly 
305 310 315 320 

Gly Ala Gly Arg Thr Gly Asp Glu Lys Lys Glu Asn Glu 
325 330 

<210> 67 
<211> 936 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 67 

atgccccgtg tggcggtggt ccagcgcccg ccggtgtttc tcgaccgcgc cgcgaccctc 60 

gagaacgccg tggcttcgct cgccgaggcc gcgtcgaacg gggctcgcct cgcggtcttt 120 

ccggaagccc tggttcccgg ctatccggcg tggatgtggc ggctgcggcc cgggcccgac 180 

atggcgctca ccgagcggat tcacgcgcgc ttgcgggcga actcggtgag cctcgccgcc 240 

gacgagctcg cgccgctgcg cgaggcggcc cggcgccacg agctcaccgt agtgtgcggc 300 

ctgcacgagc gcgacgaggc gctcggcggc ggcacgctct ataacaccgt cgtcacgatc 360 

ggcgccgacg gcgcggtgct caaccgccac cggaagctga tgcccaccaa ccccgagcgc 420 

atggtctggg gctgcggcga tgccagcggg ctcaggacgg tccccaccca gtgcgggcgc 480 

gtcggcgccc tgatctgctg ggaaagctac atgccgcttg cacgctacgc gctgtacgcc 540 
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840 
900 



cagggaatcg acctctacgt cacgccgacc tacgacagcg gcgagcgggc ggttgcgacc 600 

atgcagcaca ttgcccgcga aggcggctgc tgggtggtga gctgcggctc ggcgtttcag 660 

gcgcgcgacg tcccggacgc gtttccgggg aagagcgagc ttttccgcga caacgacgag 720 

tggatcaacc cgggcgactc ggtcgtggtc gcgccgggcg gcaaggtcgt cgccgggccg 780 
ctgcacaaag aacgcgcgat cctgtacgcc gagatcgacc tcgagcgggt cggcgtggcg 
cgccgcagcc tggacgtggt cggccattat gcgcggcccg acctcttcga cctgcacgtg 

aacgcccgcc cgcaaagcgt ggttgaattg cgctga 936 

<210> 68 
<2U> 311 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 68 

Met Pro Arg Val Ala Val Val Gin Arg Pro Pro Val Phe Leu Asp Arg 

15 10 15 

Ala Ala Thr Leu Glu Asn Ala Val Ala Ser Leu Ala Glu Ala Ala Ser 

20 25 30 

Asn Gly Ala Arg Leu Ala Val Phe Pro Glu Ala Leu Val Pro Gly Tyr 

35 40 45 

Pro Ala Trp Met Trp Arg Leu Arg Pro Gly Pro Asp Met Ala Leu Thr 

50 55 60 

Glu Arg He His Ala Arg Leu Arg Ala Asn Ser Val Ser Leu Ala Ala 
65 70 75 80 

Asp Glu Leu Ala Pro Leu Arg Glu Ala Ala Arg Arg His Glu Leu Thr 

85 90 95 

Val Val Cys Gly Leu His Glu Arg Asp Glu Ala Leu Gly Gly Gly Thr 

100 105 110 

Leu Tyr Asn Thr Val Val Thr lie Gly Ala Asp Gly Ala Val Leu Asn 

115 120 125 

Arg His Arg Lys Leu Met Pro Thr Asn Pro Glu Arg Met Val Trp Gly 

130 135 140 

Cys Gly Asp Ala Ser Gly Leu Arg Thr Val Pro Thr Gin Cys Gly Arg 
145 150 155 160 

Val Gly Ala Leu He Cys Trp Glu Ser Tyr Met Pro Leu Ala Arg Tyr 

165 170 175 

Ala Leu Tyr Ala Gin Gly He Asp Leu Tyr Val Thr Pro Thr Tyr Asp 

180 185 190 

Ser Gly Glu Arg Ala Val Ala Thr Met Gin His He Ala Arg Glu Gly 

195 200 205 

Gly Cys Trp Val Val Ser Cys Gly Ser Ala Phe Gin Ala Arg Asp Val 

210 215 220 

Pro Asp Ala Phe Pro Gly Lys Ser Glu Leu Phe Arg Asp Asn Asp Glu 
225 230 235 240 

Trp He Asn Pro Gly Asp Ser Val Val Val Ala Pro Gly Gly Lys Val 

245 250 255 

Val Ala Gly Pro Leu His Lys Glu Arg Ala He Leu Tyr Ala Glu He 

260 265 270 

Asp Leu Glu Arg Val Gly Val Ala Arg Arg Ser Leu Asp Val Val Gly 

275 280 285 

His Tyr Ala Arg Pro Asp Leu Phe Asp Leu His Val Asn Ala Arg Pro 

290 295 300 

Gin Ser Val Val Glu Leu Arg 
305 310 

<210> 69 
<211> 939 
<212> DNA 
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<213> Unknown 
<220> 

<223> Obtained from an environmental sample 
<400> 69 

gtgaccgagt ttcggacggt gcgggtcgca gcggtgcagg cgacgccggt gaccctcgac 60 

gccgatgcct cggtcgagaa ggcgatcggg ctgatcggcg aggcggtggc cggtggagcg 120 

cagctcgtcg tgctgcccga ggccttcgtg tcgctctacc cgtcgaacgc gtgggcgcga 180 

gcggccgccg gattcggcgg cttcgacgag ctctgggagc ggatgtgggc cagctcgctc 240 

gacgtcccgg gcccgctggt cgaccggctg gtcgatgcgt gccgcaggca tgacgtggta 300 

tgcgtgatcg gcgtgaacga gcgcgaaagc gaaaggccgg ggtcgcttta caacacgatg 360 

ctgaccctcg gcccgtcggg cctcctgcac cggcaccgca agctcatgcc gacgcaccac 420 

gagcggctgt tccatgggat cggcgacggt caagacctcg gcgttgtgga gaccgacgcg 480 

ggacggatcg ggggactgat ctgctgggag aaccgaatgc cgctcgcgcg ctacgcggtc 540 

taccagggtg gaccgcagat ctgggtcgcg ccgacggccg atgactccga cggctggctc 600 

gcgagcatgc gccacatcgc gatcgagtcg ggcgcgttcg tcgtgtcggt gccgcagttc 660 

atcccggcgt ccgcgttccc cgacgatttc cccgtcgagc taccgccggg caaggaggtg 720 

ttcggccgcg gcggtgcggc gatcgtcgag ccgacctggg gcgaggtaat cgccgggccg 780 

ctctacgatc gggaggggat cgtgttcgcc gactgtgacc tgcgacgcgg cttgcatgcc 840 

aagcgctggt tcgactccgt cggccattac agccgcgcgg aggtgctcga tggcggcgtc 900 

gagcgcgtcc cggcgccggt ggacggcgaa tcgccgtga 939 

<210> 70 

<211> 312 

<212> PRT 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 70 

Val Thr Glu Phe Arg Thr Val Arg Val Ala Ala Val Gin Ala Thr Pro 

15 10 15 

Val Thr Leu Asp Ala Asp Ala Ser Val Glu Lys Ala He Gly Leu lie 

20 25 30 

Gly Glu Ala Val Ala Gly Gly Ala Gin Leu Val Val Leu Pro Glu Ala 

35 40 45 

Phe Val Ser Leu Tyr Pro Ser Asn Ala Trp Ala Arg Ala Ala Ala Gly 

50 55 60 

Phe Gly Gly Phe Asp Glu Leu Trp Glu Arg Met Trp Ala Ser Ser Leu 
65 70 75 80 

Asp Val Pro Gly Pro Leu Val Asp Arg Leu Val Asp Ala Cys Arg Arg 

85 90 95 

His Asp Val Val Cys Val He Gly Val Asn Glu Arg Glu Ser Glu Arg 

100 105 HO 

Pro Gly Ser Leu Tyr Asn Thr Met Leu Thr Leu Gly Pro Ser Gly Leu 

115 120 125 

Leu His Arg His Arg Lys Leu Met Pro Thr His His Glu Arg Leu Phe 

130 135 140 

His Gly He Gly Asp Gly Gin Asp Leu Gly Val Val Glu Thr Asp Ala 
145 150 155 160 

Gly Arg He Gly Gly Leu He Cys Trp Glu Asn Arg Met Pro Leu Ala 

165 170 175 

Arq Tyr Ala Val Tyr Gin Gly Gly Pro Gin He Trp Val Ala Pro Thr 

180 185 190 

Ala Asp Asp Ser Asp Gly Trp Leu Ala Ser Met Arg His lie Ala He 

195 200 205 

Glu Ser Gly Ala Phe Val Val Ser Val Pro Gin Phe He Pro Ala Ser 

210 215 220 

Ala Phe Pro Asp Asp Phe Pro Val Glu Leu Pro Pro Gly Lys Glu Val 
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225 230 235 240 

Phe Gly Arg Gly Gly Ala Ala He Val Glu Pro Thr Trp Gly Glu Val 

245 250 255 

He Ala Gly Pro Leu Tyr Asp Arg Glu Gly He Val Phe Ala Asp Cys 

260 265 270 

Asp Leu Arg Arg Gly Leu His Ala Lys Arg Trp Phe Asp Ser Val Gly 

275 280 285 

His Tyr Ser Arg Ala Glu Val Leu Asp Gly Gly Val Glu Arg Val Pro 

290 295 300 

Ala Pro Val Asp Gly Glu Ser Pro 
305 310 

<210> 71 
<211> 966 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 71 

atgccaaacg agaacaccaa cgccacattc aaagttgccg ctgtgcaggc ttcgcctgtg 60 

tttcttgatc gtgccgcaac aatcgacaag gcttgcgatt tgatcgccgc tgctggcggt 120 

gaaggggcac gcttgattgt ctttccagaa gcattcatcc cgtcttatcc tgattgggta 180 

tgggcaattc cttcgggtga agagggcgta ctcaatgagt tgtacgcaga tctgctatcc 240 

aactcggtca cgattcccag tgactcgacg gacaaactgt gcagagcagc caggcttgct 300 

aatgcctacg tggtgatggg tatgagcgaa cgcaatgctg aggcaagcgg cgcgagcatg 360 

tataacacgc tattgtatat tgatgcacag ggggagattc tgggcaagca tcggaagttg 420 

gtgccaacgg gcggcgagcg gctagtctgg gcgcagggcg atggcagtac actgcaggtc 480 

tatgatactc ccttagggaa actcggtggc ttaatttgct gggagaatta tatgccactg 540 

gcccgctata ccatgtatgc ctggggcaca caaatctatg tcgcggcaac gtgggatcgg 600 

ggtcagccct ggctctctac tttacgccac attgccaaag aaggcagggt gtatgtgatt 660 

ggttgttgta tcgcgatgcg taaagacgat atcccagacc attatacaat gaaacagaag 720 

ttttactcag atgcagatga gtggattaat attggcgata gtgcgattgt taatcccgaa 780 

gggcaattta tcgctggacc ggtgcgcaag caggaagaga ttctctatgc ggagattgat 840 

ccgcgcatgg tccaagggcc gaagtggatg ctcgacgtgg cgggacatta tgccaggccg 900 

gatgtgttcg aactgattgt ccacacggat attcgaagga tgatcaaatc ggaaaagaat 960 

tcataa 966 

<210> 72 
<211> 321 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 72 



Met 


Pro 


Asn 


Glu 


Asn 


Thr 


Asn 


Ala 


Thr 


Phe 


Lys Val Ala Ala Val 


Gin 


1 








5 










10 


15 




Ala 


Ser 


Pro 


Val 


Phe 


Leu 


Asp 


Arg 


Ala 


Ala 


Thr He Asp Lys Ala 


Cys 








20 










25 




30 




Asp 


Leu 


He 


Ala 


Ala 


Ala 


Gly 


Gly 


Glu 


Gly 


Ala Arg Leu He Val 


Phe 




35 










40 






45 




Pro 


Glu 


Ala 


Phe 


He 


Pro 


Ser 


Tyr 


Pro 


Asp 


Trp Val Trp Ala He 


Pro 




50 










55 








60 




Ser 


Gly 


Glu 


Glu 


Gly 


Val 


Leu 


Asn 


Glu 


Leu 


Tyr Ala Asp Leu Leu 


Ser 


65 








70 










75 


80 


Asn 


Ser 


Val 


Thr 


He 


Pro 


Ser 


Asp 


Ser 


Thr 


Asp Lys Leu Cys Arg 


Ala 










85 










90 


95 




Ala 


Arg 


Leu 


Ala 


Asn 


Ala 


Tyr 


Val 


Val 


Met 


Gly Met Ser Glu Arg 


Asn 
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100 105 HO 

Ala Glu Ala Ser Gly Ala Ser Met Tyr Asn Thr Leu Leu Tyr lie Asp 

115 120 125 

Ala Gin Gly Glu lie Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly 

130 135 140 

Gly Glu Arg Leu Val Trp Ala Gin Gly Asp Gly Ser Thr Leu Gin Val 
145 150 155 160 

Tvr Asp Thr Pro Leu Gly Lys Leu Gly Gly Leu He Cys Trp Glu Asn 

165 170 175 

Tyr Met Pro Leu Ala Arg Tyr Thr Met Tyr Ala Trp Gly Thr Gin He 

180 185 190 

Tyr Val Ala Ala Thr Trp Asp Arg Gly Gin Pro Trp Leu Ser Thr Leu 

195 200 205 

Arg His He Ala Lys Glu Gly Arg Val Tyr Val He Gly Cys Cys He 

210 215 220 

Ala Met Arg Lys Asp Asp He Pro Asp His Tyr Thr Met Lys Gin Lys 
225 230 235 240 

Phe Tyr Ser Asp Ala Asp Glu Trp He Asn He Gly Asp Ser Ala He 

245 250 255 

Val Asn Pro Glu Gly Gin Phe He Ala Gly Pro Val Arg Lys Gin Glu 

260 265 270 

Glu He Leu Tyr Ala Glu He Asp Pro Arg Met Val Gin Gly Pro Lys 

275 280 285 

Trp Met Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Glu 

290 295 300 

Leu He Val His Thr Asp He Arg Arg Met He Lys Ser Glu Lys Asn 
305 310 315 320 

Ser 



<210> 73 
<211> 1035 
<212> DNA 
<213> Unknown 

<220> 

<223> -Obtained from an environmental sample 



60 



<400> 73 fc . ^ , 

atgacagcaa tagactcaac gtttaaagtc gccgccgttc aggctgcgcc ggtcttcctc 

aatcgcgacg caaccgtgga gaaggcgtgc cggctgatca agtccgcggc agagggaggc 120 

gcgcgtctga tcgttttccc ggaagcgttc ataccggcct acccggactg ggtgtggacg 180 

gtccctgccg gtgagcaagg cctgctcaac gacctctacg gccaactcgt cgaccagtcc 240 

gtgacgattc ccagcgacat caccaccgag ttatgtaacg cggcacgggc agcaaacgcc 300 

tatgtcgtga ttggtgtcaa cgagcgcaac gcggaggcaa gcaatggaag cctctacaac 360 

tcgct octet acatcgacgc aaacggcaaa attcteggta agcaccgcaa gctcgttccc 420 

acaggeggag aacggctcgt gtgggcgcag ggcgatggca gcacgctcga agcctacgac 480 

aeggagctgg geaaactegg eggtctcatt tgctgggaga actatatgee gctggcacgc 540 

tacgegatgt acgcatgggg agtgcagctc tatgtcgccg cgacctggga ccgtggcggc 600 

ccctggactg ccacgctgcg tcatgtcgcc aaggaaggtc agatgtacgt catcgggtgc 660 

tgccaggccc tgcacaagga tgacctgccg gagctagacg ggctgaagga gaagtactac 720 

gccaacgcac gagagtggat caatgttggc gaeagegcta ttgtcggccc ggacggacaa 780 

ttccttgtcg agcccgtccg aatgcgggaa gacatcctct aegecgaggt ggacactcgc 840 

aacttccgcg geccgaagtg gatgttcgac gcggctggac actacgcgcg tcccgacatt 900 

ttccaactca cagtgaaccg egagcagegg ccgatggtcc gegtegtegg tgacagcagt 960 

gaccagaagg agcggccgct cccggacgac ggaeggctet ggtacgecta cagcaccaat 1020 
cagcaccacg actga 



<210> 74 
<211> 344 
<212> PRT 
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<213> Unknown 
<220> 

<223> Obtained from an environmental sample 
<400> 74 

Met Thr Ala He Asp Ser Thr Phe Lys Val Ala Ala Val Gin Ala Ala 

15 10 15 

Pro Val Phe Leu Asn Arg Asp Ala Thr Val Glu Lys Ala Cys Arg Leu 

20 25 30 

lie Lys Ser Ala Ala Glu Gly Gly Ala Arg Leu He Val Phe Pro Glu 

35 40 45 

Ala Phe He Pro Ala Tyr Pro Asp Trp Val Trp Thr Val Pro Ala Gly 

50 55 60 

Glu Gin Gly Leu Leu Asn Asp Leu Tyr Gly Gin Leu Val Asp Gin Ser 
65 70 75 80 

Val Thr He Pro Ser Asp He Thr Thr Glu Leu Cys Asn Ala Ala Arg 

85 90 95 

Ala Ala Asn Ala Tyr Val Val He Gly Val Asn Glu Arg Asn Ala Glu 

100 105 110 

Ala Ser Asn Gly Ser Leu Tyr Asn Ser Leu Leu Tyr He Asp Ala Asn 

115 120 125 

Gly Lys He Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly Gly Glu 

130 135 140 

Arg Leu Val Trp Ala Gin Gly Asp Gly Ser Thr Leu Glu Ala Tyr Asp 
145 150 155 160 

Thr Glu Leu Gly Lys Leu Gly Gly Leu He Cys Trp Glu Asn Tyr Met 

165 170 175 

Pro Leu Ala Arg Tyr Ala Met Tyr Ala Trp Gly Val Gin Leu Tyr Val 

180 185 190 

Ala Ala Thr Trp Asp Arg Gly Gly Pro Trp Thr Ala Thr Leu Arg His 

195 200 205 

Val Ala Lys Glu Gly Gin Met Tyr Val He Gly Cys Cys Gin Ala Leu 

210 215 220 

His Lys Asp Asp Leu Pro Glu Leu Asp Gly Leu Lys Glu Lys Tyr Tyr 
225 230 235 240 

Ala Asn Ala Arg Glu Trp He Asn Val Gly Asp Ser Ala He Val Gly 

245 250 255 

Pro Asp Gly Gin Phe Leu Val Glu Pro Val Arg Met Arg Glu Asp He 

260 265 270 

Leu Tyr Ala Glu Val Asp Thr Arg Asn Phe Arg Gly Pro Lys Trp Met 

275 280 285 

Phe Asp Ala Ala Gly His Tyr Ala Arg Pro Asp He Phe Gin Leu Thr 

290 295 300 

Val Asn Arg Glu Gin Arg Pro Met Val Arg Val Val Gly Asp Ser Ser 
305 310 315 320 

Asp Gin Lys Glu Arg Pro Leu Pro Asp Asp Gly Arg Leu Trp Tyr Ala 

325 330 335 

Tyr Ser Thr Asn Gin His His Asp 
340 

<210> 75 
<211> 1125 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 75 

atgagcacca ttgttaaagc cgctgcggtt caaatcagcc cagtcctcta cagccgcgag 



60 
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gggacagtcg caaaagttgt gcggaagatc cacgaacttg gccaaaaggg ggtgcggttc 120 

gccacgttcc cggagaccgt ggttccctac tatccatatt tttccgccgt ccagaccccc 180 

attcaactat tgtccggaac cgagtacctg aagttgctcg accaaggcgt gaccgtgccg 240 

tccacgacta ccgacgcaat cggggaggct gcccggaacg ccggcatggt tgtatctatc 300 

ggcgtgaatg agcgtgacgg cgggaccctg tacaacgcgc agttgctctt cgatgcggat 360 

gggaccttga ttcagcgtcg ccgcaagatc actcctacgc attacgagcg catgatctgg 420 

ggccagggag atggttcggg tttgcgggcc gtcaagagcc aggttggtcg tattggccaa 480 

cttgcatgct ttgagcacaa caacccactg gcgcgttacg cgatgatggc cgatggcgag 54 0 

caaatccatt cggccatgta tccaggttcc gcgttcggcg aggggttcgc ggaaaagatg 600 

gaaatcaata tccgccagca tgcgttggag tccgggtgct tcgttgtgaa tgcaacggcc 660 

tggcttgacg ccagccagca ggcacaaatc atgaatgaca cgggttgcca aatcggtccg 720 

atctcgggcg gttgctttac cacgatcgta acacccgacg gcacgtttct gggcgaacct 780 

ctccggtcgg gtgagggcga ggtcatcgcc gatctcgatt tcaagctgat cgacaaacgc 840 

aagatgttga tggactcgcg cggccactac agtcgcccgg aattgctcag tctgctgatc 900 

gaccgcaccc ccaccgcgca cattcatgag cgaggtgcgc cgcagacgtc aggcgctgtg 960 

caagaggcga cgaaagtggg ttcacacgcg ccgctcctgc gtgacggaca atgggatcag 1020 

ctcaatgcgg gagcgggccg acatacaggg aatggagaag cacagataga aatcatggcc 1080 

gcggcccact cgggcacccg tggaattgaa gcgaagggag cctaa 1125 

<210> 76 
<211> 374 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<400> 76 , 
Met Ser Thr lie Val Lys Ala Ala Ala Val Gin lie Ser Pro Val Leu 

! 5 10 15 

Tvr Ser Arg Glu Gly Thr Val Ala Lys Val Val Arg Lys lie His Glu 

20 25 30 

Leu Gly Gin Lys Gly Val Arg Phe Ala Thr Phe Pro Glu Thr Val Val 

35 40 45 

Pro Tyr Tyr Pro Tyr Phe Ser Ala Val Gin Thr Pro He Gin Leu Leu 

50 55 60 

Ser Gly Thr Glu Tyr Leu Lys Leu Leu Asp Gin Gly Val Thr Val Pro 
65 70 75 80 

Ser Thr Thr Thr Asp Ala He Gly Glu Ala Ala Arg Asn Ala Gly Met 

85 ^O 95 

Val Val Ser He Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn 

100 105 HO 

Ala Gin Leu Leu Phe Asp Ala Asp Gly Thr Leu He Gin Arg Arg Arg 

115 120 125 

Lys He Thr Pro Thr His Tyr Glu Arg Met He Trp Gly Gin Gly Asp 

130 135 140 

Gly Ser Gly Leu Arg Ala Val Lys Ser Gin Val Gly Arg He Gly Gin 
145 150 155 160 

Leu Ala Cys Phe Glu His Asn Asn Pro Leu Ala Arg Tyr Ala Met Met 

165 170 175 

Ala Asp Gly Glu Gin He His Ser Ala Met Tyr Pro Gly Ser Ala Phe 

180 185 190 

Gly Glu Gly Phe Ala Glu Lys Met Glu He Asn He Arg Gin His Ala 

195 200 205 

Leu Glu Ser Gly Cys Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala 

210 215 220 

Ser Gin Gin Ala Gin He Met Asn Asp Thr Gly Cys Gin He Gly Pro 
225 230 235 240 

He Ser Gly Gly Cys Phe Thr Thr He Val Thr Pro Asp Gly Thr Phe 

245 250 255 

Leu Gly Glu Pro Leu Arg Ser Gly Glu Gly Glu Val He Ala Asp Leu 
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260 






265 










270 






Asp 


Phe 


Lys 
275 


Leu 


He Asp Lys 


Arg 
280 


Lys 


Met 


Leu 


Met 


Asp 
285 


Ser 


Arg 


Gly 


His 


Tyr 
290 


Ser 


Arg 


Pro Glu Leu 
295 


Leu 


Ser 


Leu 


Leu 


He 
300 


Asp 


Arg 


Thr 


Pro 


Thr 


Ala 


His 


He 


His Glu Arg 


Gly 


Ala 


Pro 


Gin 


Thr 


Ser 


Gly 


Ala 


Val 


305 








310 








315 










320 


Gin 


Glu 


Ala 


Thr 


Lys Val Gly Ser 


His 


Ala 


Pro 


Leu 


Leu 


Arg 


Asp 


Gly 










325 






330 










335 




Gin 


Trp 


Asp 


Gin 


Leu Asn Ala 


Gly 


Ala 


Gly 


Arg His 


Thr 


Gly 


Asn 


Gly 








340 






345 










350 






Glu 


Ala 


Gin 
355 


He 


Glu He Met 


Ala 
360 


Ala 


Ala 


His 


Ser 


Gly 
365 


Thr 


Arg 


Gly 


He 


Glu 
370 


Ala 


Lys 


Gly Ala 





















<210> 77 
<211> 1056 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
<400> 77 

atgccaaccc ccagcgatca tttcaaaatc gccgctgttc aggcctcgcc cgtgtttctg 60 

gaccgggagg ccactgtgga aaaggcctgc cggttgatcg ccgaagccgc aaagcagggc 120 

gtccgcctca tcgtctttcc ggaatcgttc atcccgacct acccggactg ggtatgggcc 180 

gttcccccgg gaagggaaag aatcctgaac cagctgtatt ctgaattcct ggccaatgcc 240 

gtcgatgttc ccggcgcggc gaccgaacaa cttgcccagg ctgcacgaat ggccggcgcc 300 

tatgtgatta tgggcgtcac cgaaagagac acctcggcca gcggggccag cctctacaac 360 

accctgctct acttcagccc cgaaggcatc ctaatgggca aacaccggaa gctggttccc 420 

acggggggcg aacggctggt ctgggcctac ggagacggca gcacgctgga ggtctacgac 480 

actccgctgg gaaagatcgg cgggctgatc tgctgggaga actacatgcc cctggcccgg 540 

tacacgatgt acgcctgggg cacccagatt tacatcgccg ccacctggga ccgcggggaa 600 

ccgtggctct ccaccctgcg gcatatcgca aaggaaggaa gggtctacgt catcgggtgc 660 

tgcatcgccc tgcgccaggg ggatatcccg gaccggttcg agtacaaggg aaaattttat 720 

tccgggtccc gggagtggat caatgagggc gacagcgcca tcgtgaaccc ggacggggaa 780 

ttcatcgccg ggccggtgcg gacgaaggag gagatcctgt atgccgagat agacccccgg 840 

cagatgcggg gccccaagtg gatgctcgat gtggccggtc attacgcccg gccggatatc 900 

ttcgagctca tcgtccaccg gaatccccac ccgatgatca aaatcgccga agacaggggc 960 

acggggatcg cctcaagttt gattcgcccc cgccctaacc ttcccccatc aagggggagg 1020 

aaatcggcaa gaagcaaacg caagcccaaa aaatga 1056 

<210> 78 
<211> 351 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 78 

Met Pro Thr Pro Ser Asp His Phe Lys He Ala Ala Val Gin Ala Ser 

1 5 10 15 

Pro Val Phe Leu Asp Arg Glu Ala Thr Val Glu Lys Ala Cys Arg Leu 

20 25 30 

He Ala Glu Ala Ala Lys Gin Gly Val Arg Leu He Val Phe Pro Glu 

35 40 45 

Ser Phe He Pro Thr Tyr Pro Asp Trp Val Trp Ala Val Pro Pro Gly 
50 55 60 
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Arg Glu 


Arg 


He Leu Asn Gin Leu Tyr 


Ocl 


UJLU 


trllc 


Leu 


Ala 


Asn 


Ala 


65 




70 
















80 


Val Asp Val 


Pro Gly Ala Ala Thr Glu 


Uj.il 


Leu 


Hid 


uin 


nlo 


Ala Arg 








85 




yu 














Met Ala Gly 


Ala Tyr Val 


He Met Gly 


vaj. 


Thr 


Glu 


Arg 


Asp 


Thr 


Ser 






100 




105 










JL1U 






Ala Ser 


Gly 
115 


Ala 


Ser Leu 


Tyr Asn Thr 
120 


Leu 


Leu 


Tyr 


trne 

-IOC 

IZO 


oer 


Pro 


Glu 


Gly He 


Leu 


Met Gly Lys 


His Arg Lys 


Leu 


Val 


Pro 


1 HIT 


oJ.y 


Gly 


Glu 


130 








135 






140 










Arg Leu 


Val 


Trp 


Ala Tyr 


Gly Asp Gly 




Thr 


Leu 


Glu 


Val 


Tyr 


Asp 


145 






150 






155 










160 


Thr Pro 


Leu 


Gly Lys He 


Gly Gly Leu 


x±e 


Cys 


Trp 


Glu 


Asn 


Tyr 


Met 








165 




170 










175 




Pro Leu 


Ala 


Arg Tyr Thr Met Tyr Ala 


Trp 


Gly Thr 


Gin 


He 


Tyr 


He 






180 




185 










190 






Ala Ala 


Thr 


Trp Asp Arg Gly Glu Pro 


Trp 


Leu 


Ser 


Thr 


Leu 


Arg 


His 




195 






200 








205 








He Ala 


Lys 


Glu Gly Arg Val Tyr Val 


x±e 


Gly Cys 


Cys 


He 


Ala 


Leu 


210 








215 






220 










Arg Gin Gly 


Asp 


He Pro 


Asp Arg Phe 


Glu 


T.yr 


Lys 


Gly Lys 


Phe 


Tyr 


225 






230 






235 












Ser Gly Ser 


Arg 


Glu Trp 


He Asn Glu 


Gly 


Asp 


Ser 


Ala 


He 


Val 


Asn 








245 




250 










255 




Pro Asp Gly 


Glu 


Phe He 


Ala Gly Pro 


Val 


Arg 


Thr 


Lys 


Glu 


Glu 


He 






260 




265 










270 






Leu Tyr Ala 


Glu 


He Asp 


Pro Arg Gin 


Met 


Arg Gly 


Pro 


Lys 


Trp 


Met 




275 






280 








285 








Leu Asp 


Val 


Ala 


Gly His 


Tyr Ala Arg 


Pro 


Asp 


He 


Phe 


Glu 


Leu 


He 


290 








295 






300 










Val His 


Arg 


Asn 


Pro His 


Pro Met He 


Lys 


He 


Ala 


Glu Asp Arg 


Gly 


305 




310 






315 










320 


Thr Gly 


He 


Ala 


Ser Ser 


Leu He Arg 


Pro 


Arg 


Pro 


Asn 


Leu 


Pro 


Pro 






325 




330 










335 




Ser Arg Gly 


Arg Lys Ser 


Ala Arg Ser 


Lys 


Arg Lys 


Pro 


Lys 


Lys 





340 345 350 



<210> 79 
<211> 990 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 79 

atgacgaaga aaagcggccg cgattcgttt cgggtcgctg cggtccaggc 
tacctggatc gggaacggag catcgagaaa gcgtgccggc tgatcgacga 
aacgacgccg acctcgtcgt gttccccgaa gccttcgtgc ccggataccc 
tggctcgttc cgccggggcg caccgcagac ttgcgctccg cttatgcgac 
aacgcgatca gcattccgga cgactccacc gatcggctgt gcgccgccgc 
ggcgtcgccg tcgcgatcgg cgtcaacgaa cgcaacaccg aagcgagcgg 
ttcaacacgc tgctctatat cggagcggac ggccggattc tcggaaaaca 
gtaccgaccg gcggcgaacg gctcgtctgg gcatctggcg acggcagcga 
tactcgctgc cgttcggtcg cgtaagcgga ctgatctgct gggagcacta 
gcccggtatg cgctcgccgc gtggggcgaa caggtgcacg tcgctccaac 
ggcgagccgt ggctgtccac gctaaggcac atcgcgaagg aaggccgcgt 
ggctgctgtc aagccgtgcg caaggacgac atccctgaca cgctcgcgtt 
tacctcgcag acgtggacgg ctggatcaac ccaggtggca gcgtcatcat 
ggcaaggtcg tcgcgggacc ggcgatggaa accgaaactg tactgtacgc 
accgagcagc tcgtcggacc gcgctggcag ctcgacgtcg gcggacatta 



ctcgtccgtc 60 

cgcgggacga 120 

actgtgggtg 180 

gctccacgcc 240 

aaaagacgcc 300 

catgagcctg 360 

ccggaagctg 420 

cctcgaggtc 4 80 

catgccgctc 540 

ctgggatcgt 600 

tctcgtcgtc 660 

caagtccaaa 720 

caatcctgac 780 

ggaccttcgc 840 

cgctcgtccg 900 
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gacgtcttcg agctcgtcgt ccatcggcat ccgaagccgt 
gtcaggcgcc gcaagcgtgc acgtcgctaa 



tgattcggac agcgaccggt 



960 
990 



<210> 80 
<211> 329 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 80 



Met 


Thr 


Lys 


Lys Ser 


Gly 


Arg 


Asp 


Ser 


Phe 


Arg 


val 


Ala 


Ala 


Val 


Gin 


1 






5 










10 










15 




Ala 


Ser 


Ser 


Val Tyr 


Leu 


Asp 


Arg 


Glu 


Arg 


Ser 


He 


Glu 


Lys 


Ala 


Cys 








20 








25 










30 






Arg 


Leu 


lie 


Asp Asp 


Ala 


Gly 


Arg 


Asn 


Asp 


Ala 


Asp 


Leu 


Val 


Val 


Phe 






35 








40 










45 








Pro 


Glu 


Ala 


Phe Val 


Pro 


Gly 


Tyr 


Pro 


Leu 


Trp 


Val 


Trp 


Leu 


Val 


Pro 




50 








55 










60 










Pro 


Gly 


Arg 


Thr Ala 


Asp 


Leu 


Arg 


Ser 


Ala 


Tyr 


Ala 


Thr 


Leu 


His 


Ala 


65 








70 










75 










80 


Asn 


Ala 


lie 


Ser lie 


Pro 


Asp 


Asp 


Ser 


Thr 


Asp 


Arg 


Leu 


Cys 


Ala 


Ala 








85 










90 










95 




Ala 


Lys 


Asp 


Ala Gly 


Val 


Ala 


Val 


Ala 


He 


Gly 


Val 


Asn 


Glu 


Arg 


Asn 








100 








105 










110 






Thr 


Glu 


Ala 


Ser Gly 


Met 


Ser 


Leu 


Phe 


Asn 


Thr 


Leu 


Leu 


Tyr 


He 


Gly 






115 








120 










125 








Ala 


Asp 


Gly 


Arg lie 


Leu 


Gly 


Lys 


His 


Arg 


Lys 


Leu 


Val 


Pro 


Thr 


Gly 




130 








135 










140 










oJ.y 


ulU 


Arg 


lieu vol 


irp 


a J- a 


q ar 




Asp 






Asp 


Leu 


ulu 


Val 


145 








150 










155 










160 


Tyr 


Ser 


Leu 


Pro Phe 


Gly 


Arg 


Val 


Ser 


Gly 


Leu 


He 


Cys 


Trp 


Glu 


His 








165 










170 










175 




Tyr 


Met 


Pro 


Leu Ala 


Arg 


Tyr 


Ala 


Leu 


Ala 


Ala 


Trp 


Gly 


Glu 


Gin 


Val 








180 








185 










190 






His 


Val 


Ala 


Pro Thr 


Trp 


Asp 


Arg 


Gly 


Glu 


Pro 


Trp 


Leu 


Ser 


Thr 


Leu 






195 








200 










205 








Arg 


His 


lie 


Ala Lys 


Glu 


Gly 


Arg 


Val 


Leu 


Val 


Val 


Gly 


Cys 


Cys 


Gin 




210 








215 










220 










Ala 


Val 


Arg 


Lys Asp 


Asp 


He 


Pro 


Asp 


Thr 


Leu 


Ala 


Phe 


Lys 


Ser 


Lys 


225 








230 










235 










240 


Xyr 


Leu 


Ala 


Asp Val 


Asp 


Gly 


Trp 


He 


Asn 


Pro 


Gly 


Gly 


Ser 


Val 


He 








245 










250 










255 




lie 


Asn 


Pro 


Asp Gly 


Lys 


Val 


Val 


Ala 


Gly 


Pro 


Ala 


Met 


Glu 


Thr 


Glu 








260 








265 










270 






Thr 


Val 


Leu 


Tyr Ala 


Asp 


Leu 


Arg 


Thr 


Glu 


Gin 


Leu 


Val 


Gly 


Pro 


Arg 






275 








280 










285 








Trp 


Gin 


Leu 


Asp Val 


Gly 


Gly 


His 


Tyr 


Ala 


Arg 


Pro 


Asp 


Val 


Phe 


Glu 




290 








295 










300 










Leu 


Val 


Val 


His Arg 


His 


Pro 


Lys 


Pro 


Leu 


He 


Arg 


Thr 


Ala 


Thr 


Gly 


305 








310 










315 










320 


Val 


Arg 


Arg 


Arg Lys 


Arg 


Ala 


Arg 


Arg 

















<210> 81 
<211> 993 
<212> DNA 
<213> Unknown 

<220> 



325 
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<223> Obtained from an environmental sample 



60 
120 
180 
240 
300 
360 



540 
600 
660 
720 
780 
840 
900 
960 
993 



atgaaagtcg tcaaagccgc cgctgtccag ttcagcccgg tgctctatag ccgcgaagcg 
accgtcgcca aggtcgtccg gaaaatccac gagctcggtc agaaaggcgt gcagttcgcc 
acctttcctg aaacggtcgt gccttattac ccttacttcg cggccgtcca sacgggcatc 
gagctcttgt cgggcaccga acatctgcgc ctgctcgaac aggccgtgac tgtgccctcc 
gctgcgaccg atgcaatcgg cgaagccgcg cgacaggccg gcatggtcgt gtccatcggc 
atcaataagc gtgacggcgg cacgctttac aacacgcaac tgctcttcga tgccgacggt 
acgctgatcc agcgccgccg caagatcacg ccgacccatt tcgaacgcat gatctggggg 20 
cagggagatg gctcgggctt gcgtgcagtc gacagcgcag tcggccgcat cggccagctc 480 
gca?gcttcg agcacaacaa cccgcttgca cgttacgcaa tgatcgccga cggcgagcag 540 
atccattcag cgatgtaccc tggctcggcc tttggcgagg gcttcgccca gcgtatggag 
atcaacatcc gccagcatgc gctcgagtcc gccgctttcg tcgtcaacgc aacggcgtgg 
cttgacgccg accagcaggc gcaaatcatg aaggacaccg gttgtggaat cggtccgatc 
tcgggcggct gcttcaccac gatcgtttct cctgacggta tgctgatggc cgatccgctt 
cgctcgggcg aaggcgaagt gattgtcgat ctcgacttca cgcagatcga ccgccgcaag 
atqctgatgg actcggccgg ccactacaac cgccctgaac tgctgagtct gatgatcgac 
cgtacgccgg ctgcgcatgt tcacgaacgc gcttcgcgcc cgatgaccgt cgacgaccag 
agttccggcg atctgcgcac ccaggttgca tga 

<210> 82 
<211> 330 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

Met°Lys 2 Val Val Lys Ala Ala Ala Val Gin Phe Ser Pro Val Leu Tyr 

1 5 10 15 

Ser Arg Glu Ala Thr Val Ala Lys Val Val Arg Lys lie His Glu Leu 

20 25 30 

Glv Gin Lys Gly Val Gin Phe Ala Thr Phe Pro Glu Thr Val Val Pro 

35 40 45 

Tyr Tyr Pro Tyr Phe Ala Ala Val Gin Thr Gly lie Glu Leu Leu Ser 

50 55 60 

Gly Thr Glu His Leu Arg Leu Leu Glu Gin Ala Val Thr Val Pro Ser 
70 

Ala Ala Thr Asp Ala He Gly Glu Ala Ala Arg Gin Ala Gly Met Val 

85 90 95 

Val Ser lie Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn Thr 

100 105 . 

Gin Leu Leu Phe Asp Ala Asp Gly Thr Leu He Gin Arg Arg Arg Lys 

115 120 I 25 

He Thr Pro Thr His Phe Glu Arg Met He Trp Gly Gin Gly Asp Gly 

130 135 i4 ° , „ 

Ser Gly Leu Arg Ala Val Asp Ser Ala Val Gly Arg He Gly Gin Leu 
145 150 155 

Ala Cys Phe Glu His Asn Asn Pro Leu Ala Arg Tyr Ala Met lie Ala 

165 I 70 , 

Asp Gly Glu Gin He His Ser Ala Met Tyr Pro Gly Ser Ala Phe Gly 

180 18J> iyu 

Glu Gly Phe Ala Gin Arg Met Glu He Asn He Arg Gin His Ala Leu 

195 200 205 

Glu Ser Ala Ala Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp 

210 215 220 

Gin Gin Ala Gin He Met Lys Asp Thr Gly Cys Gly He Gly Pro He 

7 ?S 230 235 240 

Ser Gly Gly Cys Phe Thr Thr He Val Ser Pro Asp Gly Met Leu Met 
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245 










250 








255 




Ala Asp Pro Leu 


Arg 


Ser 


Gly Glu Gly Glu Val 


lie 


Val 


Asp Leu 


Asp 


260 










265 










270 




Phe Thr Gin lie 


Asp 


Arg 


■Arg 


Lys 


Met 


Leu 


Met 


Asp 


Ser Ala Gly His 


275 








280 










285 






Tyr Asn Arg Pro 


Glu 


Leu 


Leu 


Ser 


Leu 


Met 


lie 


Asp Arg 


Thr Pro 


Ala 


290 






295 










300 








Ala His Val His 


Glu 


Arg 


Ala 


Ser 


Arg 


Pro 


Met 


Thr 


Val 


Asp Asp 


Gin 


305 




310 










315 








320 


Ser Ser Gly Asp 


Leu 
325 


Arg 


Thr 


Gin 


Val 


Ala 
330 













<210> 83 
<211> 1071 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 83 

atgatgagtt cagcccgtgt aataaaactc gccgcagcac agctttcacc tgtgctgccg 60 

ggggagtcca caaatagccg cgacggcacc attgccaaag tcgtcgcggc gattgcggag 120 

gctgcgcgcg ccggcgcgca gctgatcgtg tttcccgaaa cggtggtgcc gtattacccg 180 

tatttctcgt tcattacgcc ggcggtgacg atgggggcgg agcatttgcg cttgtacgat 240 

cagtctgtcg tggtgccgag cgccgccact gatactgttg ccgccgctgc aaaaaaacac 300 

agcatggtgg tcgtgctcgg tattaacgaa cgcgatcacg gcacgctcta caacgcgcaa 360 

ttaattttcg atgcgagcgg cgaattatta ttaaaacgcc gaaaaattac cccgacctat 420 

cacgagcgca tggtgtgggg tcagggcgac ggcagcggtt tgaaaaccgt cgacaccgcg 480 
atcggccgtg tcggtgcgct cgcctgctgg gaacattaca acccattggc gcgttacagc * 540 

ctgatggccc agcacgaaga aattcattgc agtcaatttc cggggtcatt ggtcgggcca 600 

attttcgccg agcaaatgga agtgacaatg cgccaccacg cgctcgaatc cggttgcttc 660 

gtcgttaatg caacggcgtg gttatcggaa gcgcaaattc aatcgatcag cagcgatccc 720 

gcgatgcaaa aagcactgcg cggcggttgc tacaccgcaa ttatttcgcc cgaaggcaaa 780 

catctgtgcg agccgctacg cgaaggtgaa ggtttgattt ttgccgaagc cgatatggcg 840 

ctcattacca aacgcaaacg catgatggat tcggttggtc attacgcgcg acccgaattg 900 

ctgtcgctgt taatcgacca tcgcgccacc acaccattgc atagcgtcac cgcgagtgat 960 

gccgccgccg taaaaaatac tcggagttcc gctcatgaat cagccgatag tgaaaccatc 1020 

cgcgagtcag ttaataacgg aactccaatc gcacggcttg cgcctagttg a 1071 

<210> 84 
<211> 356 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 84 












Met Met 


Ser 


Ser 


Ala Arg Val He Lys Leu Ala Ala 


Ala Gin 


Leu 


Ser 


1 






5 10 




15 




Pro Val 


Leu 


Pro 


Gly Glu Ser Thr Asn Ser Arg Asp 


Gly Thr 


He 


Ala 






20 


25 


30 






Lys Val 


Val 


Ala 


Ala He Ala Glu Ala Ala Arg Ala 


Gly Ala Gin 


Leu 


35 




40 


45 






He Val 


Phe 


Pro 


Glu Thr Val Val Pro Tyr Tyr Pro 


Tyr Phe 


Ser 


Phe 


50 






55 60 








He Thr 


Pro 


Ala 


Val Thr Met Gly Ala Glu His Leu 


Arg Leu 


Tyr 


Asp 


65 






70 75 






80 


Gin Ser 


Val 


Val 


Val Pro Ser Ala Ala Thr Asp Thr 


Val Ala 


Ala 


Ala 








85 90 




95 





Page 55 



WO 03/097810 



PCT/US03/15712 



Ala Lys 


Lys 


His 


Ser Met 


Val 


Val 


Val 


Leu 


Gly He 


Asn Glu Arg 


Asp 






100 








105 








110 




His Gly 


Thr 


Leu 


Tyr Asn 


Ala 


Gin 


Leu 


He 


Phe Asp 


Ala Ser Gly 


Glu 




115 








120 










125 




Leu Leu 


Leu 


Lys 


Arg Arg 


Lys 


He 


Thr 


Pro 


Thr 


Tyr 


His Glu Arg 


Met 


130 








135 










140 






Val Trp 


Gly 


Gin 


Gly Asp 


Gly 


Ser 


Gly 


Leu 


Lys 


Thr 


Val Asp Thr 


Ala 


145 






150 










155 






160 


lie Gly 


Arg 


Val 


Gly Ala 


Leu 


Ala 


Cys 


Trp 


Glu 


His 


Tyr Asn Pro 


Leu 








165 








170 






175 




Ala Arg 


Tyr 


Ser 


Leu Met 


Ala 


Gin 


His 


Glu 


Glu 


He 


His Cys Ser 


Gin 






180 








185 








190 




Phe Pro 


Gly 


Ser 


Leu Val 


Gly 


Pro 


He 


Phe 


Ala 


Glu 


Gin Met Glu 


Val 




195 








200 










205 




Thr Met 


Arg 


His 


His Ala 


Leu 


Glu 


Ser 


Gly 


Cys 


Phe 


Val Val Asn 


Ala 


210 








215 










220 






Thr Ala 


Trp 


Leu 


Ser Glu 


Ala 


Gin 


He 


Gin 


Ser 


He 


Ser Ser Asp 


Pro 


225 




230 










235 






240 


Ala Met 


Gin 


Lys 


Ala Leu 


Arg 


Gly 


Gly 


Cys 


Tyr 


Thr 


Ala He He 


Ser 








245 








250 






255 




Pro Glu 


Gly 


Lys 


His Leu 


Cys 


Glu 


Pro 


Leu 


Arg Glu 


Gly Glu Gly 


Leu 






260 








265 








270 




lie Phe 


Ala 


Glu 


Ala Asp 


Met 


Ala 


Leu 


He 


Thr 


Lys 


Arg Lys Arg 


Met 




275 








280 










285 




Met Asp 


Ser 


Val 


Gly His 


Tyr 


Ala 


Arg 


Pro 


Glu 


Leu 


Leu Ser Leu 


Leu 


290 








295 










300 






lie Asp 


His 


Arg 


Ala Thr 


Thr 


Pro 


Leu 


His 


Ser 


Val 


Thr Ala Ser 


Asp 


305 






310 










315 






320 


Ala Ala 


Ala 


Val 


Lys Asn 


Thr 


Arg 


Ser 


Ser 


Ala 


His 


Glu Ser Ala 


Asp 








325 








330 






335 




Ser Glu 


Thr 


He 


Arg Glu 


Ser 


Val 


Asn 


Asn 


Gly Thr 


Pro He Ala 


Arg 






340 








345 








350 




Leu Ala 


Pro 


Ser 





















355 



<210> 85 
<211> 1014 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 85 

atgggtctgg ttcatcagaa 
gacctcgatg cgacggtgga 
gcgaagctga tcgcgtttcc 
ggggcgcccg cctgggcgat 
tcatttgaca gcccgcaggc 
gcggtgatcg gcgtctccga 
ggtcccgacg gcgagaccat 
accgtgttcg gcgagggcga 
cggctcggtg caatgtgctg 
gcccagaacg agcaggttca 
gcccatgcgc ttggctggga 
tcatgtttct tcctcggccc 
gattcccccg aaaagcacgc 
ccggacggga gttcgcttgc 
gatatcgatc tcggcatgat 
tccaggccgg acgtcacgcg 
ttttcattgc cgatcgatgc 



atacaaggtt gcggtggttc aggcggcgcc ggtctttctc 60 

caagacgatc gccctgatcg agcaggccgc agcacagggc 120 

cgagaccttc attcccggat atccgtggca gatctggctt 180 

cggccgtggc ttcgtgcagc gctatttcga taactcgttg 240 

cgaaaaaatt cgcaaggccg tcaagcgcgc caagctgacc 300 

acgcgacggc ggcagcctct atatcggcca atggctgatc 360 

tgcgaagcgc cgcaagctgc ggccgaccca tgccgaacgc 420 

cggcagcgac ctcgccgtcc atgatcgcgc cgacgtggga 480 

ctgggagcat ctgcagccgc tgtcgaaata cgcgatgtac 540 

cgtcggcgcc tggccgagct tctcattgta cgacccattc 600 

agtaaacaac gcggcgagca aggtttatgc tgtcgagggc 660 

gtgcgcggtg gtctcgcagg ccatgatcga cgagctctgc 720 

cttcctgcac gctggcggcg gccacgcggt aatctatggg 780 

cgacaaactt ccacccgatc aggagggcat tctgtatgcc 840 

cggcgtggca aagaacgccg ccgaccccgc aggacactat 900 

gctgctgctc aacacttccc gcgccaatcg cgtcgagcat 960 

cgaggtcatg agcgaaatca gacttcaggc ctga 1014 
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<210> 86 
<211> 337 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 86 

Met Gly Leu Val His Gin Lys Tyr Lys Val Ala Val Val Gin Ala Ala 

1 5 10 " 15 

Pro Val Phe Leu Asp Leu Asp Ala Thr Val Asp Lys Thr lie Ala Leu 

20 25 30 

lie' Glu Gin Ala Ala Ala Gin Gly Ala Lys Leu He Ala Phe Pro Glu 

35 40 45 

Thr Phe He Pro Gly Tyr Pro Trp Gin He Trp Leu Gly Ala Pro Ala 

50 55 60 

Trp Ala He Gly Arg Gly Phe Val Gin Arg Tyr Phe Asp Asn Ser Leu 
65 70 75 80 

Ser Phe Asp Ser Pro Gin Ala Glu Lys He Arg Lys Ala Val Lys Arg 

85 90 95 

Ala Lys Leu Thr Ala Val He Gly Val Ser Glu Arg Asp Gly Gly Ser 

100 105 HO 

Leu Tyr He Gly Gin Trp Leu He Gly Pro Asp Gly Glu Thr He Ala 

115 120 125 

Lys Arg Arg Lys Leu Arg Pro Thr His Ala Glu Arg Thr Val Phe Gly 

130 135 140 

Glu Gly Asp Gly Ser Asp Leu Ala Val His Asp Arg Ala Asp Val Gly 
145 150 155 160 

Ara Leu Gly Ala Met Cys Cys Trp Glu His Leu Gin Pro Leu Ser Lys 

165 170 175 

Tyr Ala Met Tyr Ala Gin Asn Glu Gin Val His Val Gly Ala Trp Pro 

180 185 190 

Ser Phe Ser Leu Tyr Asp Pro Phe Ala His Ala Leu Gly Trp Glu Val 

195 200 205 

Asn Asn Ala Ala Ser Lys Val Tyr Ala Val Glu Gly Ser Cys Phe Phe 

210 215 220 

Leu Gly Pro Cys Ala Val Val Ser Gin Ala Met He Asp Glu Leu Cys 
225 230 235 240 

Asp Ser Pro Glu Lys His Ala Phe Leu His Ala Gly Gly Gly His Ala 

245 250 255 

Val He Tyr Gly Pro Asp Gly Ser Ser Leu Ala Asp Lys Leu Pro Pro 

260 265 270 

Asp Gin Glu Gly He Leu Tyr Ala Asp He Asp Leu Gly Met He Gly 

275 280 285 

Val Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp 

290 295 300 

Val Thr Arg Leu Leu Leu Asn Thr Ser Arg Ala Asn Arg Val Glu His 
305 310 315 320 

Phe Ser Leu Pro He Asp Ala Glu Val Met Ser Glu He Arg Leu Gin 
325 330 335 

Ala 



<210> 87 
<211> 1062 
<212> DNA 
<213> Unknown 

<220> 
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<223> Obtained from an environmental sample 
<400> 87 

atggcggaat cgaagctgaa ggtcgccgca attcaagttg cgcccgtgtt catggatcgc 60 

gatgccacga tcgcccgcgc ctgcgagcgg atcgccgaag ccgcccgcgc cggcgcggag 120 

ttggtggtct ttcccgaggc attcgtgccc gggtatcccg actggatctg ggtggcgcgg 180 

ccaagccaac gcaaactgct caatgatctt tacgcgcacc tcgtctcgca gtcggtcgac 240 
gtgccgtcgg cctccgtgga tcgtttgcgc gacgcggctc gcgacggcgg ggtcacggtg " 300 

gtgatcggcg tcaacgagcg caacaccgaa gcgagcggcg cgagcctcta caacaccgcg 360 

ctcgtgatcg gtccactggg gcagctgatc ggccgccacc gcaagcttgt gccgaccggg 420 

ccggagcgca tggtgtgggc gcagggcgac ggcagcacgc tcgacgtcta cgacacaccc 480 

gtcggcaagc tttcgacgtt gatctgctgg gagaactaca tgccgctcgc gcgctacgcc 540 

atggcggcgt ggggcgcgcg catccacgtc gccggcacgt gggaccgcgg cgagccgtgg 600 

atctcgacca tgcgtcatgt ggcgacggag ggccgcgtat tcgtgattag ctgttgcatg 660 

gcgctgcgca aacgagacat tcccgccgag ctcgagttcg cgatgctcta tcccgacggg 720 

cgcgaatgga tcaacgccgg tgattcgctg gtcgtgaatc ccgctggcca gatcatcgct 780 

gggccgttgc acgagcagga aggaatcctc tacgccgagc tcgagcgcaa tcagatgacc 840 

ggtccgcgtt ggatgttcga cgccgccggc cattacgcgc gaccggacgt cttccaactc 900 

acggtaaacc gctccccgcg cccgatgctg cgggaggcgg gggcaaagac gagtgaggca 960 

aacacgagag atgccgtacc catggacagc acgccctcga gatcgcggcc ccgcgcggtg 1020 

gcgcgaaagg ccgcacgcac cggtcgctcc aagcggcggt ga 1062 

<210> 88 
<211> 353 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 88 

Met Ala Glu Ser Lys Leu Lys Val Ala Ala lie Gin Val Ala Pro Val 

15 10 15 

Phe Met Asp Arg Asp Ala Thr lie Ala Arg Ala Cys Glu Arg He Ala 

20 25 30 

Glu Ala Ala Arg Ala Gly Ala Glu Leu Val Val Phe Pro Glu Ala Phe 

•35 40 45 

Val Pro Gly Tyr Pro Asp Trp He Trp Val Ala Arg Pro Ser Gin Arg 

50 55 60 

Lys Leu Leu Asn Asp Leu Tyr Ala His Leu Val Ser Gin Ser Val Asp 
65 70 75 80 

Val Pro Ser Ala Ser Val Asp Arg Leu Arg Asp Ala Ala Arg Asp Gly 

85 90 95 

Gly Val Thr Val Val He Gly Val Asn Glu Arg Asn Thr Glu Ala Ser 

100 105 HO 

Gly Ala Ser Leu Tyr Asn Thr Ala Leu Val He Gly Pro Leu Gly Gin 

115 120 125 

Leu He Gly Arg His Arg Lys Leu Val Pro Thr Gly Pro Glu Arg Met 

130 135 140 

Val Trp Ala Gin Gly Asp Gly Ser Thr Leu Asp Val Tyr Asp Thr Pro 
145 150 155 160 

Val Gly Lys Leu Ser Thr Leu He Cys Trp Glu Asn Tyr Met Pro Leu 

165 170 175 

Ala Arg Tyr Ala Met Ala Ala Trp Gly Ala Arg He His Val Ala Gly 

180 185 190 

Thr Trp Asp Arg Gly Glu Pro Trp lie Ser Thr Met Arg His Val Ala 

195 200 205 

Thr Glu Gly Arg Val Phe Val He Ser Cys Cys Met Ala Leu Arg Lys 

210 215 220 

Arg Asp He Pro Ala Glu Leu Glu Phe Ala Met Leu Tyr Pro Asp Gly 
225 230 235 240 
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Arg Glu Trp 


He 


Asn 


Ala 


Gly 


Asp 




245 








Gin He He 


Ala 
260 


Gly 


Pro 


Leu 


His 


Glu Leu Glu 


Arg 


Asn 


Gin 


Met 


Thr 


275 










280 


Ala Gly His 


Tyr 


Ala 


Arg 


Pro 


Asp 


290 








295 




Ser Pro Arg 


Pro 


Met 


Leu 


Arg 


Glu 


305 






310 






Asn Thr Arg 


Asp 


Ala 
325 


Val 


Pro 


Met 


Pro Arg Ala 


Val 


Ala 


Arg 


Lys 


Ala 



340 



Arg 



Ser 


Leu Val Val Asn 


Pro 


Ala 


Gly 




250 




255 




Glu 


Gin Glu Gly He 


Leu 


Tyr 


Ala 


265 




270 






Gly 


Pro Arg Trp Met 


Phe 


Asp 


Ala 




285 








Val 


Phe Gin Leu Thr 


Val 


Asn 


Arg 




300 








Ala 


Gly Ala Lys Thr 


Ser 


Glu 


Ala 




315 






320 


Asp 


Ser Thr Pro Ser 


Arg 


Ser 


Arg 




330 




335 




Ala 


Arg Thr Gly Arg 


Ser 


Lys 


Arg 


345 




350 







<210> 89 
<211> 918 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 89 

atgaatacca aagaagtaaa ggtcgcagcc gctcaatttg ccccacattt tctgaatttg 60 

agcaaaacgg tggaaaaaac ctgcaacttg atttccgaag caggcaaaaa tggagcaaag 120 

ctcattgtat ttccggaagc cttcctctct ggttatcccg attgggtctg gttaattccc 180 

aatggaaatt caacaatgct ggatgattta tatcaggaat tggttgagaa cgctgtaaca 240 

atccctgatt caacaacaca gaaactctgt caggcagcaa aagatgccgg ggtatatgtc 300 

gcagtcggta tccatgaaag aaatgcagaa gcaagtggct tcacactttt caataccctt 360 

ctatacatta atgatcaagg cagcatcatt ggaaaacacc gaaaactgat cccaacaggg 420 

ggcgaacgcc tggtctgggg gcagggtaat ggggatacgc ttgctgcatt cgatacacac 480 

tttggcaaat tgggaggatt gctttgctgg gaaaactaca tgcccctggc tcggcaagct 540 

atgtacgcag ttgggactga agtttatgtt gccccaacct gggactccag tgagaattgg 600 

ttgctgagta tgcgccatat agccagagag ggcggcatgt ttgtgatcaa tgtttgccag 660 

gctgtccgaa aagacgatat tcctgaccgc tatgcattca agcaactcta ttctggtaat 720 

tcagaatgga tcaatagcgg caacagttgc atcatcaatc cgcgcggtga aatcattgcc 780 

ggaccatcct caaacaggca agaaatactc tacgcagatt tagatctgag tttgattaca 840 

aaatctaaac gcatgttcga tgttaccggg cattatgccc ggccggatgt gtttagatat 900 
gaaatcaaaa aaagctag 



<210> 90 
<211> 305 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 90 

Met Asn Thr Lys Glu Val Lys Val 

1 5 
Phe Leu Asn Leu Ser Lys Thr Val 
20 

Glu Ala Gly Lys Asn Gly Ala Lys 

35 40 
Leu Ser Gly Tyr Pro Asp Trp Val 

50 55 
Thr Met Leu Asp Asp Leu Tyr Gin 



Ala Ala Ala Gin Phe Ala Pro His 

10 15 
Glu Lys Thr Cys Asn Leu He Ser 
25 30 
Leu He Val Phe Pro Glu Ala Phe 
45 

Trp Leu He Pro Asn Gly Asn Ser 
60 

Glu Leu Val Glu Asn Ala Val Thr 
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65 

He Pro Asp 

Gly Val Tyr 

Gly Phe Thr 
115 

He He Gly 

130 
Val Trp Gly 
145 

Phe Gly Lys 

Ala Arg Gin 

Thr Trp Asp 
195 

Arg Glu Gly 

210 
Asp Asp He 
225 

Ser Glu Trp 

Glu He He 

Asp Leu Asp 
275 

Thr Gly His 
290 

Ser 
305 



70 75 80 

Ser Thr Thr Gin Lys Leu Cys Gin Ala Ala Lys Asp Ala 

85 90 95 

Val Ala Val Gly He His Glu Arg Asn Ala Glu Ala Ser 
100 105 110 

Leu Phe Asn Thr Leu Leu Tyr He Asn Asp Gin Gly Ser 

120 125 
Lys His Arg Lys Leu He Pro Thr Gly Gly Glu Arg Leu 

135 140 
Gin Gly Asn Gly Asp Thr Leu Ala Ala Phe Asp Thr His 
150 155 160 

Leu Gly Gly Leu Leu Cys Trp Glu Asn Tyr Met Pro Leu 

165 170 175 

Ala Met Tyr Ala Val Gly Thr Glu Val Tyr Val Ala Pro 
180 185 190 

Ser Ser Glu Asn Trp Leu Leu Ser Met Arg His He Ala 

200 *205 
Gly Met Phe Val He Asn Val Cys Gin Ala Val Arg Lys 

215 220 
Pro Asp Arg Tyr Ala Phe Lys Gin Leu Tyr Ser Gly Asn 
230 235 240 

He Asn Ser Gly Asn Ser Cys He He Asn Pro Arg Gly 

245 250 255 

Ala Gly Pro Ser Ser Asn Arg Gin Glu He Leu Tyr Ala 
260 265 270 

Leu Ser Leu He Thr Lys Ser Lys Arg Met Phe Asp Val 

280 285 
Tyr Ala Arg Pro Asp Val Phe Arg Tyr Glu He Lys Lys 
295 300 



<210> 91 
<211> 939 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 91 

atgaccaaaa 

gacagagcgg 

cccgaagcct 

atgggaaaaa 

gaggggctta 

tatcaagagg 

gatgcggacg 

atggtttggg 

atcggcacgc 

caaaacatcg 

ctgcagcata 

gcctctgaca 

tgggtaaacc 

atgcatcagg 

cgtcggaagt 

aatcgcaccg 



tcgctgtcat 
tcgacttgat 
ggcttgcagg 
ctgatgagct 
gaccattgca 
tggatggcgc 
ggcgactggc 
gttttggcga 
tgatttgctg 
aaatctatgt 
tcgcgcgcga 
tcccggacga 
ctggcgatgc 
aaaaggggct 
tcgatgcgag 
cgatgcggcc 



tcaagaacct 
tgccaatgct 
ttaccccacc 
ttacgcgcgt 
ggaggccgca 
gggaagcagc 
caacaatcat 
cggttcaggc 
ggaaaactac 
tgccccgact 
aggtggctgc 
cgttccccat 
ggtggtttat 
tctcatcgca 
cgggcactac 
agttgatttc 



ccggtctatc 
gcaagcaagg 
ttcgtctggc 
ttgctcgcca 
aaggagcatg 
agcacgatct 
cgcaagttga 
ctgaacgtcg 
atgccgttag 
tgggacagtg 
tgggtcatcg 
cgggatgagc 
aagccatttg 
gagttggacg 
gctcgccccg 
acgaattag 



tgaatctgag 
ggtgtgagtt 
gtcttgcgcc 
actcggtcga 
gcgttgtcat 
tcaacagctg 
tgcccaccaa 
ttgacaccgc 
cgcgctacgc 
gtgccatgtg 
gatgtgcaac 
tattcccgaa 
gcggcattgt 
tcgccgctgt 
atgtcttcaa 



taaatcgatg 
gattgtgttt 
gggcagcgga 
ccgtagcaaa 
tgtgctgggt 
tgcgattatt 
tccggagagg 
ggtgggcagg 
gctgtatgtc 
gcaggcgacc 
gtcgctggaa 
caaagacgaa 
ggccggcccc 
tcagtcgtca 
actgcatgtg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
939 



<210> 92 
<211> 312 
<212> PRT 
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<213> Unknown 
<220> 

<223> Obtained from an environmental sample 

Me?°Thr 2 Lys lie Ala Val He Gin Glu Pro Pro Val Tyr Leu Asn Leu 

1 5 10 15 

Ser Lys Ser Met Asp Arg Ala Val Asp Leu lie Ala Asn Ala Ala Ser 

20 25 30 

Lvs Gly Cys Glu Leu He Val Phe Pro Glu Ala Trp Leu Ala Gly Tyr 

35 40 4 5 

Pro Thr Phe Val Trp Arg Leu Ala Pro Gly Ser Gly Met Gly Lys Thr 

50 55 60 

Asp Glu Leu Tyr Ala Arg Leu Leu Ala Asn Ser Val Asp Arg Ser Lys 
65 70 75 

Glu Gly Leu Arg Pro Leu Gin Glu Ala Ala Lys Glu His Gly Val Val 

85 90 95 

He Val Leu Gly Tyr Gin Glu Val Asp Gly Ala Gly Ser Ser Ser Thr 

100 105 HO 

He Phe Asn Ser Cys Ala He He Asp Ala Asp Gly Arg Leu Ala Asn 

115 120 125 

Asn His Arg Lys Leu Met Pro Thr Asn Pro Glu Arg Met Val Trp Gly 

130 135 140 

Phe Gly Asp Gly Ser Gly Leu Asn Val Val Asp Thr Ala Val Gly Arg 
145 150 155 160 

He Glv Thr Leu He Cys Trp Glu Asn Tyr Met Pro Leu Ala Arg Tyr 

165 170 175 

Ala Leu Tyr Val Gin Asn He Glu He Tyr Val Ala Pro Thr Trp Asp 

1B0 185 190 

Ser Gly Ala Met Trp Gin Ala Thr Leu Gin His He Ala Arg Glu Gly 

195 200 205 

Glv Cvs Trp Val He Gly Cys Ala Thr Ser Leu Glu Ala Ser Asp He 

210 215 220 

Pro Asp Asp Val Pro His Arg Asp Glu Leu Phe Pro Asn Lys Asp Glu 
225 230 235 240 

Trp Val Asn Pro Gly Asp Ala Val Val Tyr Lys Pro Phe Gly Gly He 

245 250 255 

Val Ala Gly Pro Met His Gin Glu Lys Gly Leu Leu He Ala Glu Leu 

260 265 270 

Asp Val Ala Ala Val Gin Ser Ser Arg Arg Lys Phe Asp Ala Ser Gly 

275 280 285 

His Tyr Ala Arg Pro Asp Val Phe Lys Leu His Val Asn Arg Thr Ala 

290 295 300 

Met Arg Pro Val Asp Phe Thr Asn 
305 310 

<210> 93 
<211> 978 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

aJgcccaLa tcaaagccgc tgccgtgcaa atcagcccgg tgctttacag tcgcgaaggc 60 

accgtggaca aggtc?gtca acagatcatc gacctcggtc ggcaaggcgt gcagttcgcc 120 

ScttSLgg aaacgg?ggt gccttactac ccgtactttt cgtttgtgca ^cggccttt 180 

gccatgggcg cacagcacct caagttgctg gatcaatcgg tgacagtgcc 9tcggccgcc 240 

accttggcca tcggtgaagc ttgcaagcaa gcagggatag tggtgtccat cggcgtcaac 300 
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gaacgcgatg gcggtacgat ctacaacgcg caattactct tcgatgccga cggcagcctg 360 

attcagcatc gccgcaaaat caccccgacc tatcacgaac gcatggtctg ggggcaaggc 420 

gatggttccg gcctgcgcgc catcgacagt gcagtggggc gcattggctc cctggcctgt 4 80 

tgggagcatt acaacccgct ggctcgttat gccttgatgg ccgatggcga gcagatccac 540 

gccgcgatgt ttcccggctc gctggtgggc gacatttttg ccgagcagat cgaagtcacc 600 

atccgccatc acgccttgga gtccggctgt ttcgtggtca acgccaccgc ctggctggac 660 

gccgatcagc agggccaaat catgcaagac accggttgca gcctcggccc gatctcgggt 720 

ggctgcttca ccgccatcgt ttcccctgaa ggcaagttgc tcggtgagcc gctgcgttcc 780 

ggcgaagggg tggtgatcgc cgatctcgat ctggcactga tcgataagcg taaacggatg 840 

atggattcgg tcgggcatta cagtcgcccg gaactgctca gcctgttgat cgaccgcacg 900 

cccacagcgc atgtgcatga acgcagcgcg cacctggtgg ctgtcgctac cgaggagttc 960 

gatcatgcaa accaatga 978 

<210> 94 

<2U> 325 

<212> PRT 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 94 

Met Pro He He Lys Ala Ala Ala Val Gin He Ser Pro Val Leu Tyr 

15 10 15 

Ser Arg Glu Gly Thr Val Asp Lys Val Cys Gin Gin He He Asp Leu 

20 25 30 

Gly Arg Gin Gly Val Gin Phe Ala Val Phe Pro Glu Thr Val Val Pro 

35 40 45 

Tyr Tyr Pro Tyr Phe Ser Phe Val Gin Pro Ala Phe Ala Met Gly Ala 

50 55 60 

Gin His Leu Lys Leu Leu Asp Gin Ser Val Thr Val Pro Ser Ala Ala 
65 70 75 80 

Thr Leu Ala He Gly Glu Ala Cys Lys Gin Ala Gly He Val Val Ser 

85 90 95 

He Gly Val Asn Glu Arg Asp Gly Gly Thr He Tyr Asn Ala Gin Leu 

100 105 110 

Leu Phe Asp Ala Asp Gly Ser Leu He Gin His Arg Arg Lys He Thr 

115 120 125 

Pro Thr Tyr His Glu Arg Met Val Trp Gly Gin Gly Asp Gly Ser Gly 

130 135 140 

Leu Arg Ala He Asp Ser Ala Val Gly Arg He Gly Ser Leu Ala Cys 
145 150 155 160 

Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met Ala Asp Gly 

165 170 175 

Glu Gin He His Ala Ala Met Phe Pro Gly Ser Leu Val Gly Asp He 

180 185 190 

Phe Ala Glu Gin He Glu Val Thr He Arg His His Ala Leu Glu Ser 

195 200 205 

Gly Cys Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp Gin Gin 

210 215 220 

Gly Gin He Met Gin Asp Thr Gly Cys Ser Leu Gly Pro He Ser Gly 
225 230 235 240 

Gly Cys Phe Thr Ala He Val Ser Pro Glu Gly Lys Leu Leu Gly Glu 

245 250 255 

Pro Leu Arg Ser Gly Glu Gly Val Val He Ala Asp Leu Asp Leu Ala 

260 265 270 

Leu He Asp Lys Arg Lys Arg Met Met Asp Ser Val Gly His Tyr Ser 

275 280 285 

Arg Pro Glu Leu Leu Ser Leu Leu He Asp Arg Thr Pro Thr Ala His 

290 295 300 

Val His Glu Arg Ser Ala His Leu Val Ala Val Ala Thr Glu Glu Phe 
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305 310 315 320 

Asp His Ala Asn Gin 
325 

<210> 95 
<211> 966 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 95 

atgtccaacg agaataacat tgctacattc aaagttgccg cagtccaggc cacacctgtg 
tttcttgatc gtgaagcaac catcgacaaa gcttgcgcgt tgattgccac tgctggcagt 
gaaggagcgc gcctgattgt gtttccagaa gcattcatcc caacttatcc tgaatgggta 180 
tggggtattc cctccggtga gcaaggttta ctcaacgaac tctatgcaga gttgctcacc 240 
aatgcggtca ccattcccag cgatgcgact gacaggctgt gcgaggctgc gcagcttgcg 300 
aatgcctacg tagtgatggg catgagcgaa cggaacgtcg aggcgagtgg cgcaagcctg 360 
tataatacgc tgttgtacat aaatgcgcag ggggagattt tagggaaaca tcgaaagctg 420 
gtgccaacgg gcggcgaacg cctggtatgg gcgcagggtg atggcagtac gctgcaggtc 480 
tacgatactc cattgggaaa actcggtggc ttaatttgct gggaaaatta tatgccgctg 540 
gcacggtatg ctatgtatgc ctggggaaca caaatctatg tcgcggcaac gtgggatcgc 600 
ggtcaaccct ggctttctac attaaggcat atcgccaaag aaggcagggt atacgtgatt 660 
ggttgctgta tcgcgatgcg taaagacgat attccagatc gttacaccat gaagcaaaaa 720 
tattatgctg aaatggatga atggatgaat gttggtgaca gtgtgattgt caatcccgag 780 
gggcacttta ttgccgggcc tgtgcgcaag caggaagaaa ttctctacgc ggagattgat 840 
cctcgcatgg tgcaaggccc gaagtggatg ctcgatgtgg cagggcatta tgcgagaccg 
gatgtgttcc agttgacggt gcatacggat gtgaggcgga tgatgcgggt ggaagatgat 
tcataa 



<210> 96 
<211> 321 
<212> PRT 
<213> Unknown 

<220> * 

<223> Obtained from an environmental sample 



<400> 96 
















Met 


Ser Asn 


Glu 


Asn Asn 


He 


Ala 


Thr Phe Lys Val Ala Ala Val 


Gin 


1 






5 






10 


15 




Ala 


Thr Pro 


Val 


Phe Leu 


Asp 


Arg 


Glu Ala Thr He 


Asp Lys Ala 


Cys 






20 








25 


30 




Ala 


Leu lie 


Ala 


Thr Ala 


Gly 


Ser 


Glu Gly Ala Arg 


Leu He Val 


Phe 




35 








40 




45 




Pro 


Glu Ala 


Phe 


He Pro 


Thr 


Tyr 


Pro Glu Trp Val 


Trp Gly He 


Pro 




50 






55 




60 






Ser 


Gly Glu 


Gin 


Gly Leu 


Leu 


Asn 


Glu Leu Tyr Ala 


Glu Leu Leu 


Thr 


65 




70 






75 




80 


Asn 


Ala Val 


Thr 


He Pro 


Ser 


Asp 


Ala Thr Asp Arg 


Leu Cys Glu 


Ala 








85 






90 


95 




Ala 


Gin Leu 


Ala 


Asn Ala 


Tyr 


Val 


Val Met Gly Met 


Ser Glu Arg Asn 






100 








105 


110 




Val 


Glu Ala 


Ser 


Gly Ala 


Ser 


Leu 


Tyr Asn Thr Leu 


Leu Tyr He 


Asn 




115 






120 




125 




Ala 


Gin Gly 


Glu 


He Leu 


Gly 


Lys 


His Arg Lys Leu 


Val Pro Thr 


Gly 




130 






135 




140 






Gly 


Glu Arg 


Leu 


Val Trp 


Ala 


Gin 


Gly Asp Gly Ser 


Thr Leu Gin 


Val 


145 




150 






155 




160 


Tyr 


Asp Thr 


Pro 


Leu Gly 


Lys 


Leu 


Gly Gly Leu He 


Cys Trp Glu 


Asn 



60 
120 



900 
960 
966 
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165 1™ 175 

Tyr Met Pro Leu Ala Arg Tyr Ala Met Tyr Ala Trp Gly Thr Gin He 

180 185 190 

Tvr Val Ala Ala Thr Trp Asp Arg Gly Gin Pro Trp Leu Ser Thr Leu 

* 195 200 205 

Arg His lie Ala Lys Glu Gly Arg Val Tyr Val He Gly Cys Cys lie 

210 215 220 

Ala Met Arg Lys Asp Asp He Pro Asp Arg Tyr Thr Met Lys Gin Lys 
225 230 235 240 

Tvr Tyr Ala Glu Met Asp Glu Trp Met Asn Val Gly Asp Ser Val He 

245 250 255 

Val Asn Pro Glu Gly His Phe He Ala Gly Pro Val Arg Lys Gin Glu 

260 265 270 

Glu He Leu Tyr Ala Glu He Asp Pro Arg Met Val Gin Gly Pro Lys 

275 280 285 

Trp Met Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Gin 

290 295 300 

Leu Thr Val His Thr Asp Val Arg Arg Met Met Arg Val Glu Asp Asp 
305 310 315 320 

Ser 



<210> 97 
<211> 1017 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



60 
120 
180 
240 



atgggcatcg aacatccgaa atacaaggtc gccgtggtgc aagctgcgcc cgcctggctc 
gacctcgacg cgtcgatcga caagacgatc gggctgatcg aggaggcggc 9*agaaaggc 
gccaagctga tcgctttccc cgaagccttc attcccggct acccttggca 'atctggctc 
gactcacccg cctgggcgat cggccgcggt ttcgtgcagc gctatttcga caattcgctc 
gcctacgaca gcccacaggc ggaaaggctg cgacaggccg tgcggaaggc cugctcacc 300 
gccgtgatcg gcctgtccga gcgcgacggc ggcagcctct atctcgcgca ^ggctgatc 360 
gggcccgacg gtgagaccat cgcaaagcgc cgcaagctgc ggccgaccca tgccgagcgc 420 
accgtcLt? gcgaaggcga cggcagcgat ctcgccgtcc atgagcgggc cgacatcggc 480 
cggctcggcg cgctgtgctg ctgggagcat ctgcagccgc tgtcgaaatt cgccatgtac 540 
gcccagaacg agcaggtaca tgtcgcggcc tggccgagct tctcgctcta cgatcccttc 600 
gcgcc?gcgc tgggcgcgga ggtgaacaac gccgcctccc gcatctatgc 99tggaaggc 
tcctgcttcg tgcicgcacc gtgcgcgacg gtctcgcagg ccatgatcga cgagctctgc 
gatcggccgg acaagcacgc gctgctgcat gccggcggcg gcttcgccgc ^tctacggg 
cccgacggca gccagatcgg cgacaagctg ccgcccgagc aggagggcct gctgatcgcc 840 
gagatcgatc tgggcgcgat cggcgtcgcc aagaacgcgg ccgatcccgc cgggcattat 900 
?cgcggcccg acgtcacgcg gctcctgctc aacaggaagc cgaacaagcg cgtggagcag 960 
ttcgcgctgc ccgtcgacac ggtcgagccc gtcgacgtcg cggcggcagc aagctga 

<210> 98 
<211> 338 
<212> PRT 
<213> Unknown 

<220> , . 

<223> Obtained from an environmental sample 

Me?°Gly 8 Ile Glu His Pro Lys Tyr Lys Val Ala Val Val Gin Ala Ala 

1 5 1° 15 

Pro Ala Trp Leu Asp Leu Asp Ala Ser lie Asp Lys Thr He Gly Leu 



480 
540 
600 
660 
720 
780 



1017 
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20 25 30 

lie Glu Glu Ala Ala Lys Lys Gly Ala Lys Leu lie Ala Phe Pro Glu 

35 40 45 

Ala Phe lie Pro Gly Tyr Pro Trp His lie Trp Leu Asp Ser Pro Ala 

50 55 60 

Trp Ala lie Gly Arg Gly Phe Val Gin Arg Tyr Phe Asp Asn Ser Leu 
65 70 75 80 

Ala Tyr Asp Ser Pro Gin Ala Glu Arg Leu Arg Gin Ala Val Arg Lys 

85 90 95 

Ala Lys Leu Thr Ala Val He Gly Leu Ser Glu Arg Asp Gly Gly Ser 

100 105 110 

Leu Tyr Leu Ala Gin Trp Leu lie Gly Pro Asp Gly Glu Thr He Ala 

115 120 125 

Lys Arg Arg Lys Leu Arg Pro Thr His Ala Glu Arg Thr Val Tyr Gly 

130 135 140 

Glu Gly Asp Gly Ser Asp Leu Ala Val His Glu Arg Ala Asp He Gly 
145 150 155 160 

Arg Leu Gly Ala Leu Cys Cys Trp Glu His Leu Gin Pro Leu Ser Lys 

165 170 175 

Phe Ala Met Tyr Ala Gin Asn Glu Gin Val His Val Ala Ala Trp Pro 

180 185 190 

Ser Phe Ser Leu Tyr Asp Pro Phe Ala Pro Ala Leu Gly Ala Glu Val 

195 200 205 

Asn Asn Ala Ala Ser Arg He Tyr Ala Val Glu Gly Ser Cys Phe Val 

210 215 220 

Leu Ala Pro Cys Ala Thr Val Ser Gin Ala Met He Asp Glu Leu Cys 
225 230 235 240 

Asp Arg Pro Asp Lys His Ala Leu Leu His Ala Gly Gly Gly Phe Ala 

245 250 255 

Ala He Tyr Gly Pro Asp Gly Ser Gin He Gly Asp Lys Leu Pro Pro 

260 265 270 

Glu Gin Glu Gly Leu Leu He Ala Glu He Asp Leu Gly Ala He Gly 

275 280 285 

Val Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp 

290 295 300 

Val Thr Arg Leu Leu Leu Asn Arg Lys Pro Asn Lys Arg Val Glu Gin 
305 310 315 320 

Phe Ala Leu Pro Val Asp Thr Val Glu Pro Val Asp Val Ala Ala Ala 
325 330 335 

Ala Ser 



<210> 99 
<211> 1014 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 99 

atgcctgaca agagaatcgt ccgcgccgcc gcggtccaga tagcaccgga cctcgaacgg 60 

cccggtggca cgctcgagaa ggtcctcgag acgatcgacg acgccgcacg ccagggcgtg 120 

cagctcatcg tcttccccga gaccttcctg ccctactacc cgtacttttc gttcgtgcgg 180 

gcgccggtgg catcgggtgc agagcacatg cggctctatg acgaagcggt ggtcgtgccc 240 

gggccggtga cgcatgcggt ggccgagcgg gcacggcggc acggcatggt cgtcgtgctc 300 

ggcgtgaacg agcgcgatca cggcagctta tacaacgcac aactgatctt cgataccgac 360 

ggcgagctgc tgctcaagcg ccgcaagatc acgccgacgt ttcacgaacg gatgatctgg 420 

ggcatgggcg acgcagccgg cctgaaggta gcggaaacgc gtatcggccg ggtgggtgca 4 80 

ctcgcttgct gggaacacta caacccgctt gcacgttatg cactgatgac ccagcacgaa 540 

gagattcatt gcagccagtt tcccggctcg ctggtcggac ccatcttcgg tgaacagatc 600 
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gaagtgacca tccggcatca cgcactggaa tccggctgct tcgtgatcaa ttccaccggc 660 

tggctgaccg agccgcagat cgagtcgatc acgaaagatc cgggcctgca gaaggcgctt 720 

cgcggcggct gcaacacggc gatcatctcg cccgaaggcc agcatctcgc cccgccgctg 780 

cgtgagggcg agggcatggt catcgctgac ctggacatgt cgctgatcac caaacgcaaa 840 

cgcatgatgg attctgtcgg ccactacgcg cggcccgaac tgctgagcct cgccatcaac 900 

gaccggccgg cggtcacgtc ggcacccatg aacagcttct catcttcaac cgggggattg 960 

caccttgaac gcgaacgaga ccttgtcggc cgtgagccgg caattgatga ctga 1014 

<210> 100 
<211> 337 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 100 

Met Pro Asp Lys Arg lie Val Arg Ala Ala Ala Val Gin lie Ala Pro 

15 10 15 

Asp Leu Glu Arg Pro Gly Gly Thr Leu Glu Lys Val Leu Glu Thr He 

20 25 30 

Asp Asp Ala Ala Arg Gin Gly Val Gin Leu He Val Phe Pro Glu Thr 

35 40 45 

Phe Leu Pro Tyr Tyr Pro Tyr Phe Ser Phe Val Arg Ala Pro Val Ala 

50 55 60 

Ser Gly Ala Glu His Met Arg Leu Tyr Asp Glu Ala Val Val Val Pro 
65 70 75 80 

Gly Pro Val Thr His Ala Val Ala Glu Arg Ala Arg Arg His Gly Met 

85 90 95 

Val Val Val Leu Gly Val Asn Glu Arg Asp His Gly Ser Leu Tyr Asn 

100 105 HO 

Ala Gin Leu He Phe Asp Thr Asp Gly Glu Leu Leu Leu Lys Arg Arg 

115 120 125 

Lys He Thr Pro Thr Phe His Glu Arg Met He Trp Gly Met Gly Asp 

130 135 140 

Ala Ala Gly Leu Lys Val Ala Glu Thr Arg He Gly Arg Val Gly Ala 
145 150 155 160 

Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met 

165 170 175 

Thr Gin His Glu Glu He His Cys Ser Gin Phe Pro Gly Ser Leu Val 

180 185 190 

Gly Pro He Phe Gly Glu Gin He Glu Val Thr He Arg His His Ala 

195 200 205 

Leu Glu Ser Gly Cys Phe Val He Asn Ser Thr Gly Trp Leu Thr Glu 

210 215 220 

Pro Gin He Glu Ser He Thr Lys Asp Pro Gly Leu Gin Lys Ala Leu 
225 230 235 240 

Arg Gly Gly Cys Asn Thr Ala He He Ser Pro Glu Gly Gin His Leu 

245 250 255 . 

Ala Pro Pro Leu Arg Glu Gly Glu Gly Met Val He Ala Asp Leu Asp 

260 265 270 

Met Ser Leu He Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His 

275 280 285 

Tyr Ala Arg Pro Glu Leu Leu Ser Leu Ala lie Asn Asp Arg Pro Ala 

290 295 300 

Val Thr Ser Ala Pro Met Asn Ser Phe Ser Ser Ser Thr Gly Gly Leu 
305 310 315 320 

His Leu Glu Arg Glu Arg Asp Leu Val Gly Arg Glu Pro Ala He Asp 
325 330 335 

Asp 
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<210> 101 
<211> 1065 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<400> 101 cn 

atggcgacag tccatccgaa atttaaagta gccgccgtcc aggcggcccc ggcctttctc 60 

gacctcgacg cgtcggtgga aaaagcggtg cgcctgattg atgaagccgg cgccgctggt 120 

gcccggctca tcgcgtttcc agagactttt atccccggtt atccgtggtg gatctggctc 180 

ggtgctccgg cctgggcgat catgcgcggc ttcgtctccc gctatttcga caactcgctg 240 

cagtacggca ccccggaagc cgaccggctg cgggcagccg ccaaacgcaa caaaatgttc 300 

gtcgcgctcg gactgtcaga gcgcgacggc ggcagtctct acatcgccca atggattatc 360 

ggacccgacg gcgagacggt cgcaacgcgc cgcaagctca agcctactca cgccgagcgg 420 

acggtgttcg gcgaaggcga tggctcgcac cttgcggtcc acgaacttga tatcgggcgg 480 

gtcggtgcgc tgtgctgttg ggagcacctg cagccactgt cgaagtacgc gatgtatgcg 540 

cagaacgagc aagttcatat cgcggcgtgg ccgagctttt cgctttacga tccgttcgcg 600 

catgcgcttg gcgccgaggt caacaacgcg gcgagcaaga tctacgcggt cgaaggctca 660 

tgctttgtga ttgcgccatg cgcgaccgtt tcccaggcga tgatcgacga attgtgtgac 720 

tcgcccgaga agcatcagtt cctgcacgtc ggcggcggtt tcgccgtgat ctatggtccc 780 

gacggcgcgc cactcgccaa gccactggcg cccgatcagg agggtctcct ttacgcggat 840 

atcgacctcg gcatgatttc ggtcgcgaaa gcggcggccg atccggctgg acattacgcg 900 

cgcccggacg tgacccgtct gttgttcaac aatcgtcctg ggaaccgggt ggagacactc 960 

gcgctgccgg tcgaccagga ggcagaggcg ggagcaggcg gcaaacctgc gcccaagtca 1020 

ccgagtgtcg ctgcgttcac actgacgcag gcggcagccg agtag 1065 

<210> 102 
<211> 354 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 102 

Met Ala Thr Val His Pro Lys Phe Lys Val Ala Ala Val Gin Ala Ala 

15 10 15 

Pro Ala Phe Leu Asp Leu Asp Ala Ser Val Glu Lys Ala Val Arg Leu 

20 25 30 

lie Asp Glu Ala Gly Ala Ala Gly Ala Arg Leu He Ala Phe Pro Glu 

35 40 45 

Thr Phe He Pro Gly Tyr Pro Trp Trp He Trp Leu Gly Ala Pro Ala 

50 55 60 

Trp Ala He Met Arg Gly Phe Val Ser Arg Tyr Phe Asp Asn Ser Leu 
65 70 75 80 

Gin Tyr Gly Thr Pro Glu Ala Asp Arg Leu Arg Ala Ala Ala Lys Arg 

85 90 95 

Asn Lys Met Phe Val Ala Leu Gly Leu Ser Glu Arg Asp Gly Gly Ser 

100 105 HO 

Leu Tyr He Ala Gin Trp He He Gly Pro Asp Gly Glu Thr Val Ala 

115 120 125 

Thr Arg Arg Lys Leu Lys Pro Thr His Ala Glu Arg Thr Val Phe Gly 

130 135 140 

Glu Gly Asp Gly Ser His Leu Ala Val His Glu Leu Asp He Gly Arg 
145 150 155 160 

Val Gly Ala Leu Cys Cys Trp Glu His Leu Gin Pro Leu Ser Lys Tyr 

165 I™ 175 

Ala Met Tyr Ala Gin Asn Glu Gin Val His He Ala Ala Trp Pro Ser 
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180 








Phe 


Ser 


Leu 
195 


Tyr Asp 


Pro 


Phe 


Ala 
200 


Asn 


Ala 
210 


Ala 


Ser Lys 


He 


Tyr 
215 


Ala 


Ala 


Pro 


Cys 


Ala Thr 


Val 


Ser 


Gin 


225 






230 






Ser 


Pro 


Glu 


Lys His 
245 


Gin 


Phe 


Leu 


He 


Tyr 


Gly 


Pro Asp Gly 


Ala 


Pro 








260 








Gin 


Glu 


Gly 


Leu Leu 


Tyr 


Ala Asp 






275 








280 


Ala 


Lys 


Ala 


Ala Ala Asp 


Pro 


Ala 




290 








295 




Thr Arg 


Leu 


Leu Phe 


Asn 


Asn Arg 


305 








310 






Ala 


Leu 


Pro 


Val Asp 
325 


Gin 


Glu 


Ala 


Ala 


Pro 


Lys 


Ser Pro 
340 


Ser 


Val 


Ala 



Ala Glu 



185 






190 




His 


Ala 


Leu Gly 


Ala Glu Val 


Asn 








205 




Val 


Glu 


Gly Ser 


Cys Phe Val 


He 






220 






Ala 


Met 


He Asp 


Glu Leu Cys 


Asp 






235 




240 


His 


Val 


Gly Gly 


Gly Phe Ala 


Val 




250 




255 




Leu 


Ala 


Lys Pro 


Leu Ala Pro 


Asp 


265 






270 




He 


Asp 


Leu Gly 


Met He Ser 


Val 








285 




Gly 


His 


Tyr Ala 


Arg Pro Asp 


Val 






300 






Pro 


Gly 


Asn Arg 


Val Glu Thr 


Leu 






315 




320 


Glu 


Ala 


Gly Ala 


Gly Gly Lys 


Pro 




330 




335 




Ala 


Phe 


Thr Leu 


Thr Gin Ala 


Ala 


345 






350 





<210> 103 
<211> 945 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 103 

atgggcgagt tcggcgaggt gacgctgggg gtggcgcagg cggcgccggt gtacttcgac 60 

cgggaggcgt cgacggagaa ggctcgcggc ctgatccggg aggcggggga gaagggcgtc 120 

gacctgttgg cgttcgggga gacgtggctg acggggtacc cgtactggaa ggatgcgccg 180 

tggtctcggg agtacaacga cctgcgcgcg cggtacgtgg cgaatggcgt gatgataccg 240 

gggccggaga cggacgcgct atgccaggca gcggcggaag cgggggtgga cgtggcaatc 300 

ggcgtggtgg agctggagcc ggggagcctt tcgagcgtgt attgcacgtt gctgttcatc 360 

tcgcgcgagg gcgagatcct ggggcggcac cggaagctga agccgacgga ttcggaacgg 420 

cggtactggt cagagggtga tgcgacgggg ctgcgggtgt acgagcggcc atatggccgg 480 

ttgagcggat tgaactgctg ggaacacctt atgatgttgc cggggtacgc gctggcggca 540 

caggggacgc agtttcatgt ggcagcgtgg ccgaacatgg cgagctcggc gagcgagctg 600 

ctgtcgcggg cgtatgcgta ccaggccgga tgctacgtgt tgtgcgcggg cgggctcggg 660 

cctgcgccgg gagagctacc ggacggcatc gcggcggagt cgctggacca cctgacgggc 720 

gagagctgca tcatcgaccc gtggggaaaa gtgatcgcgg ggccggtgtc gtgcgaggag 780 

acgctgatta cggcgcgggt atcgaccgcg tcaatctacc ggcgcaagtc gctgacggac 840 

gtgggtggcc actactcgcg accggacgtg ttccggttcg aggtggatag gtcggagcgc 900 

ccgcgagtgg tgtttcggga tggggatgtg gacgaccggg ggtaa 945 



<210> 104 
<211> 314 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 104 

Met Gly Glu Phe Gly Glu Val Thr Leu Gly Val Ala Gin Ala Ala Pro 
15 10 15 
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Val Tyr Phe Asp Arg Glu Ala Ser Thr Glu Lys Ala Arg Gly Leu He 

20 25 30 

Arg Glu Ala Gly Glu Lys Gly Val Asp Leu Leu Ala Phe Gly Glu Thr 

35 40 45 

Trp Leu Thr Gly Tyr Pro Tyr Trp Lys Asp Ala Pro Trp Ser Arg Glu 

50 55 60 

Tyr Asn Asp Leu Arg Ala Arg Tyr Val Ala Asn Gly Val Met He Pro 
. 65 7 0 75 80 

Glv Pro Glu Thr Asp Ala Leu Cys Gin Ala Ala Ala Glu Ala Gly Val 

85 90 95 

Asp Val Ala He Gly Val Val Glu Leu Glu Pro Gly Ser Leu Ser Ser 

100 105 HO 

Val Tvr Cys Thr Leu Leu Phe He Ser Arg Glu Gly Glu He Leu Gly 

115 120 125 

Arg His Arg Lys Leu Lys Pro Thr Asp Ser Glu Arg Arg Tyr Trp Ser 

130 135 140 

Glu Gly Asp Ala Thr Gly Leu Arg Val Tyr Glu Arg Pro Tyr Gly Arg 
145 150 155 160 

Leu Ser Gly Leu Asn Cys Trp Glu His Leu Met Met Leu Pro Gly Tyr 

165 170 175 

Ala Leu Ala Ala Gin Gly Thr Gin Phe His Val Ala Ala Trp Pro Asn 

180 185 190 

Met Ala Ser Ser Ala Ser Glu Leu Leu Ser Arg Ala Tyr Ala Tyr Gin 

195 200 205 

Ala Gly Cys Tyr Val Leu Cys Ala Gly Gly Leu Gly Pro Ala Pro Gly 

210 215 220 

Glu Leu Pro Asp Gly He Ala Ala Glu Ser Leu Asp His Leu Thr Gly 
225 230 235 240 

Glu Ser Cys He He Asp Pro Trp Gly Lys Val He Ala Gly Pro Val 

245 250 255 

Ser Cys Glu Glu Thr Leu He Thr Ala Arg Val Ser Thr Ala Ser He 

260 265 270 

Tyr Arg Arg Lys Ser Leu Thr Asp Val Gly Gly His Tyr Ser Arg Pro 

275 280 285 

Asp Val Phe Arg Phe Glu Val Asp Arg Ser Glu Arg Pro Arg Val Val 

290 295 300 

Phe Arg Asp Gly Asp Val Asp Asp Arg Gly 
305 310 

<210> 105 
<211> 975 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

atgaccattg tcaaagccgc tgccgtccag attgcccccg ttctctacag ccgtgaaggc 60 
actgtagaaa aggtcgttaa caagattcgc gaactcggcg agaagggcgt gcagttcgcc 120 
gttttccctg aaaccgtcgt accgtactac ccgtactttt cctttgtgca gagccctttc 180 
aaaatgggtt ccgagcacta caaattgctc gaccaggccg ttgtcgtgcc gtcggcgacc 
accgatgcca tcggcaaagc ggccaaggaa gccaacatgg tggtgtccat cggcgtcaac 
gaacgcgatg gcagcaccct ctacaacacg cagttgctgt ttgatgccga cggcactttg 
attcaggccc gtcgcaagat ttcaccgacc taccacgaac gcatgatctg gggcatgggc 
gacggttccg gcctgcgcgc caccgacagc gcggtcgggc gcatcggaca attggcctgc 
tgggaacatt acaatccgct ggcgcgttac gccttgatcg aagacggcga acagatccac 
gcctcgatgt acccgggctc gttcgcaggt cctttattca ctcgccagat ggaagtcagc 
atccgcatgc atgccctgga atcggcgtgc ttcgtggtca actcgaccgc gtggttgtac 
ccggaacagc aagcccagat catggccgac accggttgcg agatcgggcc gatctccggc 
ggctgctaca ccgcgatcat cgacccacag ggtgaagtcg tcggcgcact gaccgaaggc 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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gagggcgaag tgattgccga catcgatctg ttccagatcg aaatccgtaa acgtcagatg 840 

gacggccgtg gtcactacag ccgtccggaa atcctgagcc tgaacatcga ccgtacgccg 900 

catcgccatg ttcacgaacg caacgaccag cagaaaccgg gtgtgatcga cactgctgaa 960 
gaaaccgggc gttga 

<210> 106 
<211> 324 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 106 

Met Thr lie Val Lys Ala Ala Ala Val Gin lie Ala Pro Val Leu Tyr 

15 10 15 

Ser Arg Glu Gly Thr Val Glu Lys Val Val Asn Lys lie Arg Glu Leu 

20 25 30 

Gly Glu Lys Gly Val Gin Phe Ala Val Phe Pro Glu Thr Val Val Pro 

35 40 45 

Tyr Tyr Pro Tyr Phe Ser Phe Val Gin Ser Pro Phe Lys Met Gly Ser 

50 55 60 

Glu His Tyr Lys Leu Leu Asp Gin Ala Val Val Val Pro Ser Ala Thr 
65 70 75 80 

Thr Asp Ala lie Gly Lys Ala Ala Lys Glu Ala Asn Met Val Val Ser 

85 90 95 

He Gly Val Asn Glu Arg Asp Gly Ser Thr Leu Tyr Asn Thr Gin Leu 

100 105 HO 

Leu Phe Asp Ala Asp Gly Thr Leu He Gin Ala Arg Arg Lys He Ser 

115 120 125 

Pro Thr Tyr His Glu Arg Met He Trp Gly Met Gly Asp Gly Ser Gly 

130 135 140 

Leu Arg Ala Thr Asp Ser Ala Val Gly Arg He Gly Gin Leu Ala Cys 
145 150 155 160 

Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu He Glu Asp Gly 

165 170 175 

Glu Gin He His Ala Ser Met Tyr Pro Gly Ser Phe Ala Gly Pro Leu 

180 185 190 

Phe Thr Arg Gin Met Glu Val Ser He Arg Met His Ala Leu Glu Ser 

195 200 205 

Ala Cys Phe Val Val Asn Ser Thr Ala Trp Leu Tyr Pro Glu Gin Gin 

210 215 220 

Ala Gin He Met Ala Asp Thr Gly Cys Glu He Gly Pro He Ser Gly 
225 230 235 240 

Glv Cys Tyr Thr Ala He He Asp Pro Gin Gly Glu Val Val Gly Ala 

245 250 255 

Leu Thr Glu Gly Glu Gly Glu Val He Ala Asp He Asp Leu Phe Gin 

260 265 270 

He Glu He Arg Lys Arg Gin Met Asp Gly Arg Gly His Tyr Ser Arg 

275 280 285 

Pro Glu He Leu Ser Leu Asn He Asp Arg Thr Pro His Arg His Val 

290 295 300 

His Glu Arg Asn Asp Gin Gin Lys Pro Gly Val He Asp Thr Ala Glu 
305 310 315 320 

Glu Thr Gly Arg 



<210> 107 
<211> 981 
<212> DNA 
<213> Unknown 
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<220> 

<223> Obtained from an environmental sample 
<400> 107 

atggccatca ttcgcgcagc agccgtacag atcagcccgg ttctttacag ccgcgaaggc 60 

accgtggaca aggtctgcca gcagatcatc acccttggca aacagggtgt gcagttcgcc 120 

gtgttcccgg aaacggtggt gccgtactac ccctattttt cctttgtgca gccggcgttc 180 

gccatgggtg cgcaacacct caaattgcta gatcaatctg taaccgtgcc atcggccgcc 240 

accctggcga ttggcgaagc gtgcaagcaa gcaggaatgg tcgtttccat cggagtcaat 300 

gaacgcgatg gcggtacgat ttacaacgcg caattactct tcgatgctga cggcacgctg 360 

attcagcatc ggcgcaaaat caccccgacc taccacgagc gcatggtctg ggggcagggc 420 

gatggttccg gtctgcgcgc catcgacagc gcggtcgggc gcatcggctc cctggcatgc 480 

tgggaacatt acaacccgct ggcccgttac gccttgatgg cagacggcga acagatccac 540 

gccgcgatgt ttcccggttc cctggtgggt gacatcttcg ccgagcagat cgaggtcacc 600 

atccgccatc acgcattgga gtcaggatgc ttcgtggtca atgcaacagc ctggctggat 660 

gcggatcagc agggccaaat aatgcaggac acaggttgcg gccttggtcc catctcgggc 720 

ggctgcttca ccgcgatcgt atcgccggaa gggaagctac ttggagagcc gcttcgctcc 780 

ggggaaggcg tagtgattgc cgacctcgat acggccttga tcgacaagcg caaacggatg 840 

atggattcag taggtcatta cagtcgtccc gagctgctca gcctattgat cgatcgatcg 900 

ccgactgcgc atgttcatga acgcgccggc tttgtttcga gcaacgccgg tttgcaggag 960 

gtcgcccatg cagaccaatg a 981 

<210> 108 
<211> 326 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 108 

Met Ala lie lie Arg Ala Ala Ala Val Gin He Ser Pro Val Leu Tyr 

15 10 15 

Ser Arg Glu Gly Thr Val Asp Lys Val Cys Gin Gin He He Thr Leu 

20 25 30 

Gly Lys Gin Gly Val Gin Phe Ala Val Phe Pro Glu Thr Val Val Pro 

35 40 45 

Tyr Tyr Pro Tyr Phe Ser Phe Val Gin Pro Ala Phe Ala Met Gly Ala 

50 55 60 

Gin His Leu Lys Leu Leu Asp Gin Ser Val Thr Val Pro Ser Ala Ala 
65 70 75 80 

Thr Leu Ala He Gly Glu Ala Cys Lys Gin Ala Gly Met Val Val Ser 

85 90 95 

He Gly Val Asn Glu Arg Asp Gly Gly Thr He Tyr Asn Ala Gin Leu 

100 105 HO 

Leu Phe Asp Ala Asp Gly Thr Leu He Gin His Arg Arg Lys He Thr 

115 120 125 . 

Pro Thr Tyr His Glu Arg Met Val Trp Gly Gin Gly Asp Gly Ser Gly 

130 135 140 

Leu Arg Ala He Asp Ser Ala Val Gly Arg He Gly Ser Leu Ala Cys 
145 150 155 160 

Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met Ala Asp Gly 

165 170 175 

Glu Gin He His Ala Ala Met Phe Pro Gly Ser Leu Val Gly Asp He 

180 185 190 

Phe Ala Glu Gin He Glu Val Thr He Arg His His Ala Leu Glu Ser 

195 200 205 

Gly Cys Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp Gin Gin 

'210 215 220 

Gly Gin lie Met Gin Asp Thr Gly Cys Gly Leu Gly Pro He Ser Gly 
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225 








£. JU 






235 




^lU 


Gly Cys 


fne 


xnr Axa 


T1a 

11c 


vai 


Ser 


Pro Glu Gly Lys 


Leu Leu Gly 


UlU 








245 








250 


255 




Pro 


Leu 


Arg 


Ser Gly 


Glu 


Gly 


Val 


Val He Ala Asp 


Leu Asp Thr 


Ala 








260 








265 


270 




Leu 


lie 


Asp 


Lys Arg 


Lys 


Arg 


Met 


Met Asp Ser Val 


Gly His Tyr 


Ser 






275 








280 




285 




Arg 


Pro 


Glu 


Leu Leu 


Ser 


Leu 


Leu 


He Asp Arg Ser 


Pro Thr Ala 


His 




290 








295 




300 






Val 


His 


Glu 


Arg Ala 


Gly 


Phe 


Val Ser Ser Asn Ala Gly Leu Gin 


Glu 


305 








310 






315 




320 


Val 


Ala 


His 


Ala Asp 


Gin 


















325 















<210> 109 
<211> 1092 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 109 

atggccatca ttcgcgcagc agccgtacag atcagcccgg ttctttacag ccgcgaaggc 60 

accgtggaca gggtctgcca gcagatcatc acccttggca aacaaggtgt gcagttcgcc 120 

gtgttcccgg aaacggtggt gccgtactac ccctattttt cctttgtgca gccggcattt 180 

gcgatgggtg cacaacacct caaattgctc gatcaatctg taaccgtgcc atcggccgcc 240 

accctggcga ttggcgaagc gtgcaagcaa gcaggaatgg tcgtttccat cggcgtcaat 300 

gaacgcgatg gcggtacgat ttacaacgcg caattactct tcgatgctga cggcactctg 360 

attcagcatc ggcgcaaaat caccccgacc taccacgagc gcatggtctg ggggcagggc 420 

gatggttccg gtctgcgcgc catcgacagc gcggtcgggc gcatcggctc cctggcatgc 480 

tgggaacatt acaacccgct ggcccgttac gccttgatgg cagacggcga acagatccac 540 

gccgcgatgt ttcccggttc cctggtgggt gacatcttcg ccgagcagat cgaggtcacc 600 

atccgccatc acgcattgga atcaggatgc ttcgtggtca atgcaacagc ttggctggat 660 

gcggatcagc agggccaaat aatgcaggac acaggttgcg gccttggtcc catctcgggc 720 

ggctgcttca ccgcgatcgt atcgccggaa gggaagctac ttggagagcc gcttcgctca 780 

ggggaaggcg tagtgattgc cgacctcgat atggccttga tcgacaagcg caaacggatg 840 

atggattcag taggtcatta cagtcgtccc gagctgctca gcctattgat cgatcgatcg 900 

ccgactgcgc attttcatga acgcgccggg ctttgttccg agcgacgccg gtttgcagga 960 

ggtcgcgcat gcagaccaat gaattgctcg ctgacctgca aatccaaggc ctgcgttggc 1020 

cggccgcgca aatggcttgt cgcgccaagg cggcgccggt ccttcagacc acaaggcgct 1080 

gagcctaggt aa 1092 

<210> 110 
<211> 363 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 110 

Met Ala He He Arg Ala Ala Ala Val Gin He Ser Pro Val Leu Tyr 

15 10 15 

Ser Arg Glu Gly Thr Val Asp Arg Val Cys Gin Gin He He Thr Leu 

20 25 30 

Gly Lys Gin Gly Val Gin Phe Ala Val Phe Pro Glu Thr Val Val Pro 

35 40 45 

Tyr Tyr Pro Tyr Phe Ser Phe Val Gin Pro Ala Phe Ala Met Gly Ala 

50 55 60 

Gin His Leu Lys Leu Leu Asp Gin Ser Val Thr Val Pro Ser Ala Ala 
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65 








70 


75 




o n 
oU 


Thr 


Leu 


Ala 


He Gly Glu 


Ala Cys Lys Gin Ala Gly Met Val 


Val 


Ser 








85 




90 


95 




lie 


Gly 


Val 


Asn Glu 
100 


Arg 


Asp Gly Gly Thr He Tyr Asn Ala 
105 110 


Gin 


Leu 


Leu 


Phe 


Asp 


Ala Asp Gly 


Thr Leu He Gin His Arg Arg Lys 


He 


Thr 






115 






120 125 






Pro 


Thr 


Tyr 


His Glu 


Arg 


Met Val Trp Gly Gin Gly Asp Gly Ser 


Gly 




130 






135 140 






Leu 


Arg 


Ala 


He Asp 


Ser 


Ala Val Gly Arg lie Gly Ser Leu 


Ala 


Cys 


145 






150 


155 




160 


Trp 


Glu 


His 


Tyr Asn 


Pro 


Leu Ala Arg Tyr Ala Leu Met Ala 


Asp 


Gly 






165 




170 


175 




Glu 


Gin 


He 


His Ala 


Ala 


Met Phe Pro Gly Ser Leu Val Gly Asp 


lie 








180 




185 190 






Phe 


Ala 


Glu 
195 


Gin He 


Glu 


Val Thr lie Arg His His Ala Leu 
200 205 


Glu 


Ser 


Gly Cys 


Phe 


Val Val 


Asn 


Ala Thr Ala Trp Leu Asp Ala Asp 


Gin 


Gin 




210 








215 220 






Gly Gin 


He 


Met Gin 


Asp 


Thr Gly Cys Gly Leu Gly Pro He 


Ser 


Gly 


225 








230 


235 




240 


Gly Cys 


Phe 


Thr Ala 


He 


Val Ser Pro Glu Gly Lys Leu Leu 


Gly 


Glu 








245 




250 


255 




Pro 


Leu Arg 


Ser Gly Glu 


Gly Val Val He Ala Asp Leu Asp Met 


Ala 








260 




265 270 






Leu 


lie 


Asp 


Lys Arg 


Lys 


Arg Met Met Asp Ser Val Gly His Tyr 


Ser 






275 






280 285 






Arg 


Pro 
290 


Glu 


Leu Leu 


Ser 


Leu Leu He Asp Arg Ser Pro Thr 
295 300 


Ala 


His 


Phe 


His 


Glu 


Arg Ala 


Gly 


Leu Cys Ser Glu Arg Arg Arg Phe 


Ala 


Gly 


305 








310 


315 




320 


Gly Arg Ala 


Cys Arg 


Pro 


Met Asn Cys Ser Leu Thr Cys Lys 


Ser 


Lys 








325 




330 


335 




Ala 


Cys 


Val 


Gly Arg 
340 


Pro 


Arg Lys Trp Leu Val Ala Pro Arg 
345 350 


Arg 


Arg 


Arg 


Ser 


Phe 
355 


Arg Pro 


Gin 


Gly Ala Glu Pro Arg 
360 







<210> 111 
<211> 990 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
<400> 111 

atgcccaaaa cagtacgtgc cgcagcagtc cagatcgcgc ccgacctgac gtcacgcgcc 60 

ggcaccgtcg agcgggtcct caatgcaatc gccgaagctg ctgacaaagg cgccgagctg 120 

atcgtatttc ccgagacctt cgtgccctgg tatccctatt tcagtttcgt tctgccacct 180 

gtccagcaag gccctgagca tcttcgtctt tatgaggaag cagtcacggt accatcagca 240 

gaaacacggg ccgtcgcgga cgccgcgcgc aaacgcaatg cggttatcgt ccttggcgtc 300 

aatgagcgcg accacggctc gctctataac actcagctga tcttcgacgc ggatggcagc 360 

ctgaaactca agcgtcgcaa gatcacgccg acctatcacg aacggatgat ctggggccaa 420 

ggcgatggcg ccggcctgaa ggttgtcgac actgccgtcg gtcgcgtggg tgccctggca 480 

tgctgggagc attacaatcc tctggcccgc tatactttga tggcccagca tgaggaaatt 540 

cacgcctctc atttcccggg ctcactggtc ggcccgatat tcggcgagca aatcgaagtc 600 

accatgcgcc accacgcgtt ggaatcgggc tgtttcgtgg tcaatgccac cggctggctg 660 

agcgaggagc agatcgcatc tattcatccg gaccccgcct tgcaaaaggg cctgcgcgat 720 

ggctgcatga cctgcatcat cacgccggaa ggacgccatg tcgtaccgcc gctgacctcg 780 

ggcgaaggca tcctgatcgg cgatctggac atgcggctca ttaccaagcg caagcggatg 840 
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atggattcgg tcggacacta tgctcggcct gaactgctgc accttgtcca tgacacgacg 900 
cccgcacgcg cacgcgagca ggtcggcctt tcaggcgatt ttcccgatgc ggagcaagac 960 
aagctatttg aggaggttca taatgcgtga 990 

<210> 112 
<211> 329 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 112 

Met Pro Lys Thr Val Arg Ala Ala Ala Val Gin He Ala Pro Asp Leu 

15 10 15 

Thr Ser Arg Ala Gly Thr Val Glu Arg Val Leu Asn Ala He Ala Glu 

20 25 30 

Ala Ala Asp Lys Gly Ala Glu Leu He Val Phe Pro Glu Thr Phe Val 

35 40 45 

Pro Trp Tyr Pro Tyr phe Ser Phe Val Leu Pro Pro Val Gin Gin Gly 

50 55 60 

Pro Glu His Leu Arg Leu Tyr Glu Glu Ala Val Thr Val Pro Ser Ala 
65 70 75 80 

Glu Thr Arg Ala Val Ala Asp Ala Ala Arg Lys Arg Asn Ala Val He 

85 90 95 

Val Leu Gly Val Asn Glu Arg Asp His Gly Ser Leu Tyr Asn Thr Gin 

100 105 HO 

Leu He Phe Asp Ala Asp Gly Ser Leu Lys Leu Lys Arg Arg Lys He 

115 120 125 

Thr Pro Thr Tyr His Glu Arg Met He Trp Gly Gin Gly Asp Gly Ala 

130 135 140 

Gly Leu Lys Val Val Asp Thr Ala Val Gly Arg Val Gly Ala Leu Ala 
145 150 155 160 

Cvs Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Thr Leu Met Ala Gin 

165 170 175 

His Glu Glu He His Ala Ser His Phe Pro Gly Ser Leu Val Gly Pro 

180 185 190 

He Phe Gly Glu Gin He Glu Val Thr Met Arg His His Ala Leu Glu 

195 200 205 

Ser Gly Cys Phe Val Val Asn Ala Thr Gly Trp Leu Ser Glu Glu Gin 

210 215 220 

He Ala Ser He His Pro Asp Pro Ala Leu Gin Lys Gly Leu Arg Asp 
225 230 235 240 

• Glv Cys Met Thr Cys He He Thr Pro Glu Gly Arg His Val Val Pro 
245 250 255 

Pro Leu Thr Ser Gly Glu Gly He Leu He Gly Asp Leu Asp Met Arg 

260 265 270 

Leu He Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His Tyr Ala 

275 280 285 

Arg Pro Glu Leu Leu His Leu Val His Asp Thr Thr Pro Ala Arg Ala 

290 295 300 

Arg Glu Gin Val Gly Leu Ser Gly Asp Phe Pro Asp Ala Glu Gin Asp 
305 310 315 320 

Lys Leu Phe Glu Glu Val His Asn Ala 
325 

<210> 113 
<211> 993 
<212> DNA 
<213> Unknown 
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<220> 

<223> Obtained from an environmental sample 
<400> 113 

atgacgaagg aacgcgccgc gcgcagcctg cgcgcagctg ccatacagct tgaagccgaa 60 

gtcggcgaca tcgccgccaa tctcgcacgc atcgaggcga tggtcgagga ggctgcgggc 120 

aagggcgccg aactgatcgc cattccggag ttctgcacct cccgcatgcc cttcgatgca 180 

cgcgtgcacg acgccgtgct gccgccggac aacttcgtgg tcgatgcctt tcgccgcatg 240 

gcagcgacgc acaactgccg gctcggcggc tccatgctca ttgccgacgg tggcgagatc 300 

tacaaccgct accacttcgt cgaacccgac ggcagcgtgc atctgcacga caaggatctg 360 

ccgacgatgt gggagaacgc cttctacacc ggcggctccg acgacggcgt cttcgacacc 420 

ggcatcggcg gcgtcggcgc cgcggtgtgc tgggaactgg tacgcaccgg caccgtgcga 480 

cgcatgctcg gtcgcgtcga cgtcgccatg accggcacgc attggtggac gatgccgcap 540 

aactggggca gcgccgtcgc gcgcacgctg gccgcgatga cgcagtacaa ccgctacatg 600 

tccgagaatg cacccaccga attcgcccgc cgcctgggtg tgccggtgct gcaggcctcg 660 

cactgcggaa gcttccgcac cggtttcttg ctgctgccag gcagcgggcg tgcactgccc 720 

tatgacaccg agtacgtcgg cgccacacag atcgtcgatg ccgatggcca catcctcgcc 780 

caccgtcgca cgcaggaagg ccccggtgtc gtcgtcgccg acatcacgct cggtgcccgc 840 

acgcccgagc tgccactgga agaccgcttc tggattcccg agctgccgct cttcctcaag 900 

gcctactggc accaccagaa cctgtgcggc aagtcctact accgtcgcgt cggccgcgat 960 

gccggcctgg cggcggcgga gcgttcggca tga 993 



<210> 114 
<211> 330 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 114 

Met Thr Lys Glu Arg Ala Ala Arg Ser Leu Arg Ala Ala Ala lie Gin 

15 10 15 

Leu Glu Ala Glu Val Gly Asp lie Ala Ala Asn Leu Ala Arg lie Glu 

20 25 30 

Ala Met Val Glu Glu Ala Ala Gly Lys Gly Ala Glu Leu lie Ala lie 

35 40 45 

Pro Glu Phe Cys Thr Ser Arg Met Pro Phe Asp Ala Arg Val His Asp 

50 55 60 

Ala Val Leu Pro Pro Asp Asn Phe Val Val Asp Ala Phe Arg Arg Met 
65 70 75 80 

Ala Ala Thr His Asn Cys Arg Leu Gly Gly Ser Met Leu lie Ala Asp 

85 90 95 

Gly Gly Glu He Tyr Asn Arg Tyr His Phe Val Glu Pro Asp Gly Ser 

100 105 HO 

Val His Leu His Asp Lys Asp Leu Pro Thr Met Trp Glu Asn Ala Phe 

115 120 125 

Tyr Thr Gly Gly Ser Asp Asp Gly Val Phe Asp Thr Gly He Gly Gly 

130 135 140 

Val Gly Ala Ala Val Cys Trp Glu Leu Val Arg Thr Gly Thr Val Arg 
145 150 155 160 

Arg Met Leu Gly Arg Val Asp Val Ala Met Thr Gly Thr His Trp Trp 

165 170 175 

Thr Met Pro His Asn Trp Gly Ser Ala Val Ala Arg Thr Leu Ala Ala 

180 185 190 

Met Thr Gin Tyr Asn Arg Tyr Met Ser Glu Asn Ala Pro Thr Glu Phe 

195 200 205 

Ala Arg Arg Leu Gly Val Pro Val Leu Gin Ala Ser His Cys Gly Ser 

210 215 220 

Phe Arg Thr Gly Phe Leu Leu Leu Pro Gly Ser Gly Arg Ala Leu Pro 
225 230 235 240 
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Tyr 


Asp 


Thr 




T\/y- Va 1 Civ Ala 


Thr 


Gin 


lie Val Asd Ala Asp 


Gly 






245 




250 


255 




His 


lie 


Leu 


Ala 


His Arg Arg Thr 


Gin 


Glu 


Gly Pro Gly Val Val 


Val 








260 




265 




270 




Ala 


Asp 


lie 


Thr 


Leu Gly Ala Arg 


Thr 


Pro 


Glu Leu Pro Leu Glu 


Asp 




275 




280 






285 




Arg 


Phe 


Trp 


lie 


Pro Glu Leu Pro 


Leu 


Phe 


Leu Lys Ala Tyr Trp 


His 


290 




295 






300 




His 


Gin 


Asn 


Leu 


Cys Gly Lys Ser 


Tyr 


Tyr 


Arg Arg Val Gly Arg 


Asp 


305 








310 






315 


320 


Ala 


Gly Leu 


Ala 


Ala Ala Glu Arg 


Ser 


Ala 







325 330 



<210> 115 
<211> 993 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 115 

atgaaccaaa tcattaaagc ggcggcagtt caatgtagcc ctgtgttgta tagccaagcg 60 

ggtacagtca agaaaatctg tgacacgatt ttggagttgg ggcagcaagg tgtgcaattt 120 

gccgtatttc ctgaaactgt tgtgccttat tacccttatt tttcttttgt gcaaccaccg 180 

tttgccatgg gtaaagaaca tttaaagcta ttgcatgaat cggttgtcgt gccatcggca 240 

gcaacaactt taattggaca ggcatgcaaa gaagcgaaca tggtggtttc tattggtatt 300 

aatgagcgtg caggcggcac gatttataac gctcaattgt tgtttgatgc ggatggttcg 360 

attattcagc atcgccgtaa aattacccca acgtatcatg aacgtatggt gtgggggcaa 420 

ggcgatggca gtggtttacg tgcgatagat tctgctgtag gacgtattgg gtcgctggca 480 

tgttgggagc attacaaccc tttggctcgg tttgctttga tggcggatgg tgagcaaatt 540 

catgcggcga tgtttccggg atcactcgtg gggcagattt ttgcagatca gatcagtgcc 600 

accattcagc accatgcttt agagtcgggc tgttttgtgg tgaatgccac agcatggctt 660 

gacccagagc aacaacaaca aattatgcaa gatacaggct gtgaactcgg tccaatttcg 720 

gggggatgtt ttacggccat cgtttctcca gaaggcaaat ttttgtctga accgatcaca 780 
caaggcgaag gttatgtgat tgccgattta gacttttcct taatcgaaaa acgtaaacgg 
atgatggatt ctgttgggca ttatagtcgt ccagaattac tcagtttgtt gattgatcgt 
cgtcctacct cagttttgca tgagttaaaa ctagagaatc catcgaataa cagcatcgaa 

aaagtgtctg aatttgccga ggtacacgca tag 993 

<210> 116 
<211> 330 
<212> PRT 
<213> Unknown 



840 
900 
960 



<220> 

<223> Obtained from an environmental sample 



<400> 116 












Met Asn Gin 


He 


He 


Lys 


Ala 


Ala 


1 




5 








Tyr Ser Gin 


Ala 
20 


Gly 


Thr 


Val 


Lys 


Leu Gly Gin 


Gin 


Gly 


Val 


Gin 


Phe 


35 










40 


Pro Tyr Tyr 


Pro 


Tyr 


Phe 


Ser 


Phe 


50 








55 




Lys Glu His 


Leu 


Lys 


Leu 


Leu 


His 


65 






70 






Ala "Thr Thr 


Leu 


lie 


Gly Gin 


Ala 






85 









Ala Val Gin Cys Ser Pro Val Leu 

10 15 
Lys He Cys Asp Thr He Leu Glu 
25 30 
Ala Val Phe Pro Glu Thr Val Val 
45 

Val Gin Pro Pro Phe Ala Met Gly 
60 

Glu Ser Val Val Val Pro Ser Ala 

75 80 
Cys Lys Glu Ala Asn Met Val Val 
90 95 
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Ser lie Gly lie Asn Glu Arg Ala Gly Gly Thr He Tyr Asn Ala Gin 

100 105 HO 

Leu Leu Phe Asp Ala Asp Gly Ser He He Gin His Arg Arg Lys He 

115 120 125 

Thr Pro Thr Tyr His Glu Arg Met Val Trp Gly Gin Gly Asp Gly Ser 

130 135 140 

Glv Leu Arg Ala He Asp Ser Ala Val Gly Arg He Gly Ser Leu Ala 
145 150 155 160 

Cvs Trp Glu His Tyr Asn Pro Leu Ala Arg Phe Ala Leu Met Ala Asp 

165 170 175 

Gly Glu Gin He His Ala Ala Met Phe Pro Gly Ser Leu Val Gly Gin 

180 185 190 

He Phe Ala Asp Gin He Ser Ala Thr He Gin His His Ala Leu Glu 

195 200 205 

Ser Gly Cys Phe Val Val Asn Ala Thr Ala Trp Leu Asp Pro Glu Gin 

210 215 220 

Gin Gin Gin He Met Gin Asp Thr Gly Cys Glu Leu Gly Pro He Ser 
225 230 235 240 

Glv Glv Cys Phe Thr Ala He Val Ser Pro Glu Gly Lys Phe Leu Ser 

* 245 250 255 

Glu Pro He Thr Gin Gly Glu Gly Tyr Val He Ala Asp Leu Asp Phe 

260 265 270 

Ser Leu He Glu Lys Arg Lys Arg Met Met Asp Ser Val Gly His Tyr 

275 280 285 

Ser Arg Pro Glu Leu Leu Ser Leu Leu He Asp Arg Arg Pro Thr Ser 

290 295 300 

Val Leu His Glu Leu Lys Leu Glu Asn Pro Ser Asn Asn Ser He Glu 
305 310 315 320 

Lys Val Ser Glu Phe Ala Glu Val His Ala 
325 330 

<210> 117 
<211> 957 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<400> 117 ^ ^ , n 

atgactcaat ccaggataat tcgtgctgcg gcagcgcaga tcgctccgga tttgcaggtt bo 
ccaggtaaca cgatcgacaa agtttgccgc accatcagcg aggcggccgc aaaaggcgta 120 
cagattattg ttttccctga aaccttggtg ccttattacc cttacttctc ttacatttca 180 
ccgcccattc aacagggcaa agaacatttg cggctgtatg accatgcagt ggttgtgccc 
ggctcggaaa ccgaggcaat ttcagctctt gccgcccaac acaatatggt ggtggttttg 
ggtgtgaacg agcgcgatca cggcacactt tacaacgcac aaattatttt caacagcgac 
ggaaagattc tgttgaagcg ccgaaaaatt acaccaactt atcacgagcg gatggtgtgg 420 
gggcagggtg acgcttcagg cttgaaggtg gttgattccg cagtgggccg tgtgggtgca 480 
ttggcctgtt gggaacacta caaccccttg gctcgctatt gtttgatggc ccagcacgaa 540 
gaaattcact gtgcgcagtt tcccggttca ttggtggggc aagtttttgc cgaccaaatg 
gaagtgacca ttcgtcacca cgcacttgag tcgggctgtt ttgtcatcaa cagcaccgct 
tggctttctg aagaacaggt tcaaagtatt tcatccgaca gcgcattgca gaaagggctt 
agaggcggtt gtttcacggc cattgtcagc cctgagggaa agctgttggc tgagccgctc 780 
accgagggtg agggcatggt gatcgccgac ctcgacatgg cgttggttac gaaacgcaaa 840 
cgcatgatgg a??cagtggg ccattatgcg cgccccgagt tgttgagttt gctggttcgg 900 
gatgaggctt caagccccat gaaaaaaatt cagggagttc aacatgctga gtactga 

<210> 118 
<211> 318 
<212> PRT 
<213> Unknown 



240 
300 
360 



600 
660 
720 



957 
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<220> 

<223> Obtained from an environmental sample 
<400> 118 

Met Thr Gin Ser Arg lie lie Arg Ala Ala Ala Ala Gin lie Ala Pro 

15 10 15 

Asp Leu Gin Val Pro Gly Asn Thr lie Asp Lys Val Cys Arg Thr lie 

20 25 30 

Ser Glu Ala Ala Ala Lys Gly Val Gin He He Val Phe Pro Glu Thr 

35 40 45 

Leu Val Pro Tyr Tyr Pro Tyr Phe Ser Tyr He Ser Pro Pro He Gin 

50 55 60 

Gin Gly Lys Glu His Leu Arg Leu Tyr Asp His Ala Val Val Val Pro 
65 70 75 80 

Gly Ser Glu Thr Glu Ala He Ser Ala Leu Ala Ala Gin His Asn Met 

85 90 95 

Val Val Val Leu Gly Val Asn Glu Arg Asp His Gly Thr Leu Tyr Asn 

100 105 HO 

Ala Gin He He Phe Asn Ser Asp Gly Lys He Leu Leu Lys Arg Arg 

115 120 125 

Lys He Thr Pro Thr Tyr His Glu Arg Met Val Trp Gly Gin Gly Asp 

130 135 140 

Ala Ser Gly Leu Lys Val Val Asp Ser Ala Val Gly Arg Val Gly Ala 
145 150 155 160 

Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Cys Leu Met 

165 170 175 

Ala Gin His Glu Glu He His Cys Ala Gin Phe Pro Gly Ser Leu Val 

180 185 190 

Gly Gin Val Phe Ala Asp Gin Met Glu Val Thr He Arg His His Ala 

195 200 205 

Leu Glu Ser Gly Cys Phe Val He Asn Ser Thr Ala Trp Leu Ser Glu 

210 215 220 

Glu Gin Val Gin Ser He Ser Ser Asp Ser Ala Leu Gin Lys Gly Leu 
225 230 235 240 

Arg Gly Gly Cys Phe Thr Ala He Val Ser Pro Glu Gly Lys Leu Leu 

245 250 255 

Ala Glu Pro Leu Thr Glu Gly Glu Gly Met Val He Ala Asp Leu Asp 

260 265 270 

Met Ala Leu Val Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His 

275 280 285 

Tyr Ala Arg Pro Glu Leu Leu Ser Leu Leu Val Arg Asp Glu Ala Ser 

290 295 300 

Ser Pro Met Lys Lys He' Gin Gly Val Gin His Ala Glu Tyr 
305 310 315 

<210> 119 
<211> 984 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 119 

atggatacac tcaaagttgg attggttcag atggccccca tctggttgaa ccgggataaa 60 
accctgatca aagttgagga atacatgcag aaagcaggca aacagggctg caacctggta 120 
gcttttggtg aagcgctggt tcccggctac cccttctggg tggaacgcac agagggcgcc 180 
agattcaatt ccaaagtcca gaaagaactc tttgcacatt accttgatca ggcggtgcag 240 
atcgaagccg gccaccttga tcctctccag gcattagccc aacaatacaa gatggctgtg 



300 



tacgtgggga cgattgaacg cccgcctgag cggagcggcc acagcctgta ctgctcccta 360 
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<210> 120 
<211> 327 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 120 

Met Asp Thr Leu Lys Val Gly Leu Val Gin Met Ala Pro lie Trp Leu 

15 10 15 

Asn Arg Asp Lys Thr Leu lie Lys Val Glu Glu Tyr Met Gin Lys Ala 

20 25 30 

Gly Lys Gin Gly Cys Asn Leu Val Ala Phe Gly Glu Ala Leu Val Pro 

35 40 45 

Glv Tvr Pro Phe Trp Val Glu Arg Thr Glu Gly Ala Arg Phe Asn Ser 

50 55 60 

Lys Val Gin Lys Glu Leu Phe Ala His Tyr Leu Asp Gin Ala Val Gin 
65 70 75 80 

He Glu Ala Gly His Leu Asp Pro Leu Gin Ala Leu Ala Gin Gin Tyr 

85 90 95 

Lys Met Ala Val Tyr Val Gly Thr He Glu Arg Pro Pro Glu Arg Ser 

100 105 HO 

Gly His Ser Leu Tyr Cys Ser Leu lie Phe He Asp Pro Glu Gly Glu 

115 120 125 

He Gly Ser Val His Arg Lys Leu Met Pro Thr His Glu Glu Arg Leu 

130 135 140 

Val Trp Ser Thr Gly Asp Gly His Gly Leu Arg Thr His Ser Leu Gly 
145 150 155 160 

Ala Phe Thr Val Gly Gly Leu Asn Cys Trp Glu Asn Trp Met Pro Leu 

165 170 175 

Ser Arg Thr Ala Leu Tyr Ala Met Gly Glu Asp Leu His Val Ala Ala 

180 185 190 

Trp Pro Gly Ser Gin Arg Asn Thr Tyr Asp He Thr Lys Phe He Ala 

195 200 205 

Lys Glu Ser Arg Ser Tyr Val He Ser Val Ser Gly Met Met Lys Lys 

210 215 220 

Glu Asn He Leu Ser Glu He Pro . His Ser Gin Leu Met Leu Glu Asn 
225 230 235 240 

Ser Glu Asp He Met Ala Asp Gly Gly Ser Cys Leu Ala Gly Pro Asp 

245 250 255 

Glv Glu Trp He He Glu Pro He Val Gly Glu Glu Thr Leu Val Thr 

260 265 270 

Ala Glu Leu Ser His Gin Arg Val Arg Glu Glu Arg Gin Asn Phe Asp 

275 280 285 

Pro Thr Gly His Tyr Ser Arg Pro Asp Val Thr Arg Leu Val Val Asp 

290 295 300 

Arg Arg Arg Gin Gin He Leu Glu He Thr Pro Asp Glu Lys Gly Arg 
305 310 315 320 



420 
480 
540 
600 
660 



atatttatag acccagaagg cgagatcggc tcggttcacc gcaagttgat gcccacccat 
gaggaacgcc tggtctggtc aactggcgat gggcacggcc tgcgaacaca ttctctgggc 
gcctttaccg ttggcggact caactgctgg gaaaactgga tgccgctctc ccgcacagct 
ctttatgcca tgggagagga tcttcatgtt gctgcctggc ccgggagtca gcgcaatact 
tatgatataa ccaaattcat tgccaaggaa tctcgctctt atgtgatctc cgtatccggg 
atgatgaaaa aagaaaatat cctctctgaa attccccaca gccaattgat gctggaaaat 720 
agcgaggata ttatggctga tggcggatcc tgtctggctg gaccagatgg agaatggatc 780 
atcgagccca tcgtcggaga ggaaaccctg gtaactgctg aactatcaca tcagcgggtc 840 
agagaagaaa gacagaattt cgacccaaca ggtcactaca gtcggcctga tgtgacccgc 900 
ctggtagtcg accgcaggcg ccagcagatc ctggagatca ccccggacga aaaaggaaga 960 
tcggatgaaa atcaatccct ttaa 



960 
984 
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Ser Asp Glu Asn Gin Ser Leu 
325 

<210> 121 
<211> 1158 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 121 

atgagcaaaa aagttctagg cggcagagaa aaagtaaaag ttgcagtagt tcaggctgcg 60 

cccgttttca tggacaagga gaagacgatt gaaaaggctt gcaagctaat aaaagaagcg 120 

gggagaaatc gagccgagct catagcgttc tcagagtcat tcatccccgt ctatcctgca 180 

tactataccg tcggctatga aaccccttct caagaatgga gagattacgt gattgcgcta 24 0 

caggataact ccgtgctgat tccgagcgag gataccgagg tactcggaca ggctgcaaag 300 

gaggcagggg cttatgcagt aataggatgc agcgagatgg acgaccgtcc gggaagccga 360 

acagtttaca acacgctcct cttcatcggc aaagacggca aggtcatggg aaggcataga 420 

aaactcaaac ccacgttcac ggagagaata tactggggag agggagatgc tggagacata 480 

aaggtttttg ataccgagat cggcaggatc ggaggcctcg tatgctggga gaaccatatg 540 

actctagtca gggccgcgat gatacacagg ggagaggagt ttcatatcgc ggtctggccg 600 

ggaaactgga agggtgcgga aaacaagctt ctccaagcag ataatagccc aggaggcgcc 660 

ctctgcaacc ttcaatctct cattaaagta cacgcctttg aggccggggc gtttgtgctg 720 

agcgcttgcg gctttttgac gccagaggat ttcccggaaa ggtggcatta tataagggat 780 

ggtaaccata ttaactgcga ctgggcactg ggcggaagct caatcgtcaa tcccgccggc 840 

cgttatctcg tcgagcctaa ctttgagaag gatgcaatcc tctatgcgga ttgttatgca 900 

aaccagataa aagcagtaaa agcggttttt gattcccttg gccactattc ccgctgggat 960 

attgcccaac tggcgataag gcaggaagcc tggaatccag aggtttcttt gatcgattcc 1020 

tcttcgactg aagttgagct tccggcagac gagcttcgaa ggatttcgga gaagtttgaa 1080 

gtaactgcgg ataagttgga atctttgctt gaggaaattg gaaagattaa aaagcccagg 1140 

aaacaagccg gttcctaa 1158 

<210> 122 
<211> 385 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 122 



Met 


Ser 


Lys 


Lys 


Val 


Leu 


Gly 


Gly 


Arg Glu 


Lys Val Lys Val Ala Val 


1 






5 








10 


15 


val 


Gin 


Ala 


Ala 


Pro 


Val 


Phe 


Met 


Asp Lys 


Glu Lys Thr He Glu Lys 








20 










25 


30 


Ala 


Cys 


Lys 


Leu 


lie 


Lys 


Glu 


Ala 


Gly Arg 


Asn Arg Ala Glu Leu He 






35 










40 




45 


Ala 


Phe 


Ser 


Glu 


Ser 


Phe 


He 


Pro 


Val Tyr 


Pro Ala Tyr Tyr Thr Val 




50 










55 






60 


Gly 


Tyr 


Glu 


Thr 


Pro 


Ser 


Gin 


Glu 


Trp Arg 


Asp Tyr Val He Ala Leu 


65 








70 








75 80 


Gin 


Asp 


Asn 


Ser 


Val 


Leu 


He 


Pro 


Ser Glu 


Asp Thr Glu Val Leu Gly 








85 








90 


95 


Gin 


Ala 


Ala 


Lys 


Glu 


Ala 


Gly 


Ala 


Tyr Ala 


Val He Gly Cys Ser Glu 








100 










105 


110 


Met 


Asp 


Asp 


Arg 


Pro 


Gly 


Ser 


Arg 


Thr Val 


Tyr Asn Thr Leu Leu Phe 






115 










120 




125 


lie 


Gly 


Lys 


Asp 


Gly 


Lys 


Val 


Met 


Gly Arg 


His Arg Lys Leu Lys Pro 




130 










135 






140 


Thr 


Phe 


Thr 


Glu 


Arg 


lie 


Tyr 


Trp 


Gly Glu 


Gly Asp Ala Gly Asp He 
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145 






150 


155 




160 


Lvs 


Val 


Phe Asp 


Thr Glu lie Gly Arg 


He Gly Gly 


Leu 


Val Cys Trp 








165 


170 




175 


Glu 


Asn 


His Met 


Thr Leu Val Arg Ala 


Ala Met He 


His Arg Gly Glu 






180 


185 






190 


Glu 


Phe 


His lie 


Ala Val Trp Pro Gly 


Asn Trp Lys 


Gly Ala Glu Asn 






195 


200 




205 




Lys 


Leu 


Leu Gin 


Ala Asp Asn Ser Pro 


Gly Gly Ala 


Leu 


Cys Asn Leu 




210 




215 


220 






Gin 


Ser 


Leu Xle 


Lys Val His Ala Phe 


Glu Ala Gly 


Ala 


Phe Val Leu 


225 






230 


235 




240 


Ser 


Ala 


Cys Gly 


Phe Leu Thr Pro Glu 


Asp Phe Pro 


Glu Arg Trp His 








245 


250 




255 


Tyr 


lie 


Arg Asp 


Gly Asn His lie Asn 


Cys Asp Trp 


Ala Leu Gly Gly 






260 


265 






270 


Ser 


Ser 


lie Val 


Asn Pro Ala Gly Arg 


Tyr Leu Val 


Glu 


Pro Asn Phe 






275 


280 




285 




Glu Lys 


Asp Ala 


lie Leu Tyr Ala Asp 


Cys Tyr Ala 


Asn 


Gin He Lys 




290 




295 


300 






Ala 


Val 


Lys Ala 


Val Phe Asp Ser Leu 


Gly His Tyr 


Ser 


Arg Trp Asp 


305 






310 


315 




320 


lie 


Ala 


Gin Leu 


Ala He Arg Gin Glu 


Ala Trp Asn 


Pro 


Glu Val Ser 








325 


330 




335 


Leu 


lie 


Asp Ser 


Ser Ser Thr Glu Val 


Glu Leu Pro 


Ala 


Asp Glu Leu 






340 


345 






350 


Arg 


Arg 


lie Ser 


Glu Lys Phe Glu Val 


Thr Ala Asp 


Lys 


Leu Glu Ser 






355 


360 




365 




Leu 


Leu 


Glu Glu 


He Gly Lys He Lys 


Lys Pro Arg 


Lys 


Gin Ala Gly 




370 




375 


380 







Ser 
385 



<210> 123 

<211> 990 • 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 123 

atgtcaactt tcaagatcgc taccgtgcag 

accattgaca agacttgcga .gctgatcgcc 

gtggtcttcc ccgaggcctt tatccccacc 

ggacagcacc agatgcttgc cgacctgtac 

cccagtctgg ctaccgagcg gctctgtcag 

gtgggcctta acgaacgcaa tacagaggcc 

tacattgacg ccgagggcaa cttgctaggt 

gaacgcatgg tctgggcaca gggagatggc 

ggaaaactca gcggactaat ctgttgggag 

tatgcctggg gagtacagct ctatttggct 

tccactctgc ggcacattgc caaggaagga 

ttacgtaagg aagacatccc cgaccgattc 

ggagagtgga taaacaaagg tgacagcgtc 

gggcctctac ataaggaaca ggggatactc 

gcccccaagt ggaacctgga tgtagccgga 

accgtgagca aggatggcca tccgatgctc 

caagataaga ccgaagtatt agagggctag 

<210> 124 
<211> 329 



agtgcaccag tatttatgga 
gaagcagcac aagatgacga 
tatccggact gggtatggcg 
ggggagttgc tcgagcagtc 
gctgcaaaga aagcgggcgt 
agcaacgcta ccctgtacaa 
aagcaccgaa agctggtacc 
agtacccttg aggtctacga 
aactacatgc ctctcgctcg 
cctacttggg atcgaggcga 
cgagtatacg tggtcggctg 
gaattcaagg cgaagtacta 
atcgtcggtc ccgatggcga 
tatgctgagc tggacacaag 
cactacgcgc gcccggacgt 
ggcgttgccc aagggcccaa 



ccgcgaagct 60 

cgttcgccta 120 

tatccctccc 180 

ggtgacgata 240 

ttatgtagct 300 

caccctgctc 360 

gaccgctccc 420 

gacctccttc 480 

ttatgccctg 540 

gccctggctt 600 

ctctatcgcc 660 

cgcagaggca 720 

gctcatcgcc 780 

gcagatgcac 840 

gtttcggctg 900 

gcatgagccg 960 
990 
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<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 124 

Met Ser Thr Phe Lys lie Ala Thr Val Gin Ser Ala Pro Val Phe Met 

15 10 15 

Asp Arg Glu Ala Thr lie Asp Lys Thr Cys Glu Leu He Ala Glu Ala 

20 25 30 

Ala Gin Asp Asp Asp Val Arg Leu Val Val Phe Pro Glu Ala Phe He 

35 40 45 

Pro Thr Tyr Pro Asp Trp Val Trp Arg He Pro Pro Gly Gin His Gin 

50 55 60 

Met Leu Ala Asp Leu Tyr Gly Glu Leu Leu Glu Gin Ser Val Thr He 
65 70 75 80 

Pro Ser Leu Ala Thr Glu Arg Leu Cys Gin Ala Ala Lys Lys Ala Gly 

85 ■ 90 95 

Val Tyr Val Ala Val Gly Leu Asn Glu Arg Asn Thr Glu Ala. Ser Asn 

100 105 HO 

Ala Thr Leu Tyr Asn Thr Leu Leu Tyr He Asp Ala Glu Gly Asn Leu 

115 120 125 

Leu Gly Lys His Arg Lys Leu Val Pro Thr Ala Pro Glu Arg Met Val 

130 135 140 

Trp Ala Gin Gly Asp Gly Ser Thr Leu Glu Val Tyr Glu Thr Ser Phe 
145 " 150 155 160 

Gly Lys Leu Ser Gly Leu He Cys Trp Glu Asn Tyr Met Pro Leu Ala 

165 170 175 

Arg Tyr Ala Leu Tyr Ala Trp Gly Val Gin Leu Tyr Leu Ala Pro Thr 

180 185 190 

Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu Arg His He Ala Lys 

195 200 205 

Glu Gly Arg Val Tyr Val Val Gly Cys Ser He Ala Leu Arg Lys Glu 

210 215 220 

Asp He Pro Asp Arg Phe Glu Phe Lys Ala Lys Tyr Tyr Ala Glu Ala 
225 230 235 240 

Gly Glu Trp He Asn Lys Gly Asp Ser Val He Val Gly Pro Asp Gly 

245 250 255 

Glu Leu He Ala Gly Pro Leu His Lys Glu Gin Gly He Leu Tyr Ala 

260 265 270 

Glu Leu Asp Thr Arg Gin Met His Ala Pro Lys Trp Asn Leu Asp Val 

275 280 285 

Ala Gly His Tyr Ala Arg Pro Asp Val Phe Arg Leu Thr Val Ser Lys 

290 295 300 

Asp Gly His Pro Met Leu Gly Val Ala Gin Gly Pro Lys His Glu Pro 
305 310 315 320 

Gin Asp Lys Thr Glu Val Leu Glu Gly 
325 

<210> 125 
<211> 1050 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 125 ' 

atgacaactg taaaaaagac ggtacgcgca gcagcgatcc agatcgcacc tgacctcgac 
agtgcaggcg gtacgctgga caaggtttgc acggccattc aaaaggcggc ggcacaaggc 



60 
120 
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gcggagctgg tggtttttcc cgaaaccttc ttgccctact atccttactt ttcattcgtg 180 

cggccgccct tcgcatccgg cccggaacac ttgctgctat atgaacgcgc agtggcggtg 240 

ccaggcccgg tgaccgatgc cgtctctgcc gtcgcgcgca gccacggcgt ggtggtggta 300 

ctcggcgtca atgaacgcga ccatggcacg ctgtacaaca cccaactggt gttcgacgcg 360 

aatggcgaac tggtgttgaa acgcagaaaa atcacgccga cttatcacga gcggatgatc 420 

tggggtcaag gcgacggcag cggactcaaa gtagtgcaaa cggcggtcgg ccggctaggc 4 80 

gcgctagcct gttgggaaca ctacaaccca ctggcccgtt atgcattgat ggcgcaacac 540 

gaagaaatcc attgcgccca gtttcccggg tccatggtcg ggcaaatatt cgccgaccag 600 

atggaagtga cgatacgcca tcacgctctc gagtcggctt gcttcgtggt gaatgccaca 660 

ggctggctga ccgatgcgca aatcacatcg atcacgccgg accccgcgct acaaaaggca 720 

ttacgtggcg gttgctgcac cgccatcgtc tcgccggaag gtgtgctcct ggcagagccg 780 

ctacgcagcg gcgaaggcat ggtgatcgcc gatctcgata tggcactcat caccaaacgc 840 

aaacggatga tggattcggt cggccactat gcgcggcccg aattgttaag cctgcttgtc 900 

gacgaccggc gcaaggtacc ggtatccgcg ctatttgccg acagcaaccc tgccaacggg 960 

cacacagttt tcaccccatc cgacatacca acccttggga gcgcacatca tgcaaacagt 1020 

taccaaaccg aaccagcaac tgatcactga 1050 

<210> 126 
<2U> 349 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 126 



Met 


Thr 


Thr 


Val 


Lys 


Lys 


Thr 


Val 


Arg 


Ala 


Ala 


Ala 


He 


Gin 


He 


Ala 


1 








5 










10 










15 




Pro 


Asp 


Leu 


Asp 


Ser 


Ala 


Gly 


Gly 


Thr 


Leu 


Asp 


Lys 


Val 


Cys 


Thr 


Ala 








20 










25 










30 






He 


Gin 


Lys 


Ala 


Ala 


Ala 


Gin 


Gly 


Ala 


Glu 


Leu 


Val 


Val 


Phe 


Pro 


Glu 






35 










40 










45 








Thr 


Phe 


Leu 


Pro 


Tyr 


Tyr 


Pro 


Tyr 


Phe 


Ser 


Phe 


Val 


Arg 


Pro 


Pro 


Phe 




50 










55 










60 










Ala 


Ser 


Gly 


Pro 


Glu 


His 


Leu 


Leu 


Leu 


Tyr 


Glu 


Arg 


Ala 


Val 


Ala 


Val 


65 










70 










75 










80 


Pro 


Gly 


Pro 


Val 


Thr 


Asp 


Ala 


Val 


Ser 


Ala 


Val 


Ala 


Arg 


Ser 


His 


Gly 










85 










90 










95 




Val 


Val 


Val 


Val 


Leu 


Gly 


Val 


Asn 


Glu 


Arg 


Asp 


His 


Gly 


Thr 


Leu 


Tyr 








100 










105 










110 






Asn 


Thr 


Gin 


Leu 


Val 


Phe 


Asp 


Ala 


Asn 


Gly 


Glu 


Leu 


Val 


Leu 


Lys 


Arg 






115 










120 










125 








Arg 


Lys 


He 


Thr 


Pro 


Thr 


Tyr 


His 


Glu 


Arg 


Met 


He 


Trp 


Gly 


Gin 


Gly 




130 










135 










140 










Asp 


Gly 


Ser 


Gly 


Leu 


Lys 


Val 


Val 


Gin 


Thr 


Ala 


Val 


Gly 


Arg 


Leu 


Gly 


145 










150 










155 










160 


Ala 


Leu 


Ala 


Cys 


Trp 


Glu 


His 


Tyr 


Asn 


Pro 


Leu 


Ala 


Arg 


Tyr 


Ala 


Leu 










165 










170 










175 




Met 


Ala 


Gin 


His 


Glu 


Glu 


He 


His 


Cys 


Ala 


Gin 


Phe 


Pro 


Gly 


Ser 


Met 








180 










185 










190 






Val 


Gly 


Gin 


He 


Phe 


Ala 


Asp 


Gin 


Met 


Glu 


Val 


Thr 


He 


Arg 


His 


His 






195 










200 










205 








Ala 


Leu 


Glu 


Ser 


Ala 


Cys 


Phe 


Val 


Val 


Asn 


Ala 


Thr 


Gly 


Trp 


Leu 


Thr 




210 










215 










220 










Asp 


Ala 


Gin 


He 


Thr 


Ser 


He 


Thr 


Pro 


Asp 


Pro 


Ala 


Leu 


Gin 


Lys 


Ala 


225 










230 










235 










240 


Leu 


Arg 


Gly 


Gly 


Cys 


Cys 


Thr 


Ala 


He 


Val 


Ser 


Pro 


Glu 


Gly 


Val 


Leu 










245 










250 










255 




Leu 


Ala 


Glu 


Pro 


Leu 


Arg 


Ser 


Gly 


Glu 


Gly 


Met 


Val 


He 


Ala 


Asp 


Leu 








260 










265 










270 






Asp 


Met 


Ala 


Leu 


He 


Thr 


Lys 


Arg 


Lys 


Arg 


Met 


Met 


Asp 


Ser 


Val 


Gly 
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275 280 .285 

His Tyr Ala Arg Pro Glu Leu Leu Ser Leu Leu Val Asp Asp Arg Arg 

290 295 300 

Lys Val Pro Val Ser Ala Leu Phe Ala Asp Ser Asn Pro Ala Asn Gly 
305 310 315 320 

His Thr Val Phe Thr Pro Ser Asp He Pro Thr Leu Gly Ser Ala His 

325 330 335 

His Ala Asn Ser Tyr Gin Thr Glu Pro Ala Thr Asp His 
340 345 

<210> 127 
<211> 1005 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 127 

atgatagcac ggaagacaat aagggcggcg gcggtgcaga tagcgcctgt gatggaagat 
cggaaggcga cgaccgacaa ggtgtgcgcc tacattcagg aagcaggcga gaatggagcc 120 
gaaattgtgg tgtttcctga aaccttcatt cccaattatc cctatttctc ttttgtaaaa 180 
cctcccgtgt tggcaggtaa ggatcacctt accttgtatg accaagcggt ggaaatccct 240 
agccctacta ccgaccaagt ggggtctatg gccaaaaaat ggggaatcgt agtggtgttg 300 
ggcgtgaacg aaagaagcca cggcactttg tacaatgccc aaattgtctt tgacgctact 360 
ggtgatattg tattggtgag acgcaaaatc acccctacct atcatgaacg gatgatctgg 420 
ggacagggag atggcagtgg attaaaagca gtagacacag ctgtgggaag agtgggcgct 480 
ttggcgtgtt gggaacacta taatccactt gcgcgctacg cccttatggt agaccatgag 540 
gaaattcatt gcagccaatt ccctggctct atggtcggcc ccattttcgg tgaccagata 600 
gaagtgacga ttcgccacca tgcgttggaa tcgggttgtt ttgtcatcaa ttccacaggt 660 
tggctgtttg aagagcaaat ccaagccatc accgatgatc cgaaactgca caaagcattg 720 
aaagacggct gtatgaccgc cattatttct cccgaaggcg tgcatttgac caaaccctta " ,on 
acagaaggcg aaggcatcat ctacgcctat ctggacatga aactcataga caagcggaaa 
cggatgatgg actcggtagg acactatgca cgtccagagt tgctctcttt gcatatcaac 
aatgcagagc aaaaaccagc cgtttacacc tctcctctta ccaaaacgga aaccaaagaa 
gacgtaaaaa gctatgatcg caacaaagaa cagcttatcg tctga 

<210> 128 
<211> 334 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 128 

Met He Ala Arg Lys Thr He Arg Ala Ala Ala Val Gin He Ala Pro 

1 5 10 15 

Val Met Glu Asp Arg Lys Ala Thr Thr Asp Lys Val Cys Ala Tyr He 

20 25 30 

Gin Glu Ala Gly Glu Asn Gly Ala Glu lie Val Val Phe Pro Glu Thr 

35 40 45 

Phe He Pro Asn Tyr Pro Tyr Phe Ser Phe Val Lys Pro Pro Val Leu 

50 55 60 

Ala Gly Lys Asp His Leu Thr Leu Tyr Asp Gin Ala Val Glu He Pro 
65 70 75 80 

Ser Pro Thr Thr Asp Gin Val Gly Ser Met Ala Lys Lys Trp Gly He 

85 90 95 

Val Val Val Leu Gly Val Asn Glu Arg Ser His Gly Thr Leu Tyr Asn 

100 105 HO 

Ala Gin He Val Phe Asp Ala Thr Gly Asp He Val Leu Val Arg Arg 



60 



780 
840 
900 
960 
1005 
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115 120 125 



Lys 


lie 


Thr 


Pro Thr 


Tyr 


His 


Glu 


Arg 


Met 


He Trp 


Gly 


Gin Gly 


Asp 




130 








135 








140 










Gly 


Ser 


Gly 


Leu Lys 


Ala 


Val 


Asp 


Thr 


Ala 


Val Gly 


Arg 


Val 


Gly 


Ala 


145 








150 










155 








160 


Leu 


Ala 


Cys 


Trp Glu 


His 


Tyr 


Asn 


Pro 


Leu 


Ala Arg 


Tyr 


Ala 


Leu 


Met 








165 










170 








175 




Val 


Asp 


His 


Glu Glu 


He 


His 


Cys 


Ser 


Gin 


Phe Pro 


Gly 


Ser 


Met 


Val 








180 








185 








190 






Gly 


Pro 


He 


Phe Gly 


Asp 


Gin 


He 


Glu 


Val 


Thr He 


Arg 


His 


His 


Ala 






195 








200 








205 








Leu 


Glu 


Ser 


Gly Cys 


Phe 


Val 


He 


Asn 


Ser 


Thr Gly 


Trp 


Leu 


Phe 


Glu 




210 








215 








220 










Glu 


Gin 


He 


Gin Ala 


He 


Thr 


Asp 


Asp 


Pro 


Lys Leu 


His 


Lys 


Ala 


Leu 


225 








230 










235 








240 


Lys Asp 


Gly 


Cys Met 


Thr 


Ala 


He 


He 


Ser 


Pro Glu 


Gly 


Val 


His 


Leu 








245 










250 








255 




Thr 


Lys 


Pro 


Leu Thr 


Glu Gly 


Glu 


Gly 


He 


He Tyr 


Ala 


Tyr 


Leu 


Asp 








260 








265 








270 






Met 


Lys 


Leu 


He Asp 


Lys 


Arg 


Lys 


Arg 


Met 


Met Asp 


Ser 


Val 


Gly 


His 






275 








280 








285 








Tyr 


Ala 


Arg 


Pro Glu 


Leu 


Leu 


Ser 


Leu 


His 


He Asn 


Asn 


Ala 


Glu 


Gin 




290 








295 








300 










Lys 


Pro 


Ala 


Val Tyr 


Thr 


Ser 


Pro 


Leu 


Thr 


Lys Thr 


Glu 


Thr 


Lys 


Glu 


305 








310 










315 








320 


Asp Val 


Lys 


Ser Tyr 


Asp 


Arg 


Asn 


Lys 


Glu 


Gin Leu 


He 


Val 







325 330 



<210> 129 
<211> 1011 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 129 

atgtcagaaa agcgaattat taaagcggct gcagttcaaa tcacaccaga ttttgaatcg 60 

catgatggaa ccgtaaagaa ggtttgtaat gtaattgatg aagcgggtgc taaaggtgta 120 

cagatcattg tattccctga aacctttatt ccatattacc catatttttc tttcatcact 180 

ccaccagtga ctgctggcgc ggagcatttg cggctctatg aaaaaagtgt cgtgatacct 240 

ggtcccgtta ctcaagccat ttccgaacgt gcacgcatga ataatatggt tgttgtactt 300 

ggtgtaaatg agcgtgataa cggcagtcta tataacaccc agattatttt tgatgctacc 360 

ggtgagatgc ttctgaagag aagaaaaatc acacctacct atcatgagcg catgatttgg 420 

gggcaaggag atgcttcagg cctgaaggtc gtcgatacgg ctattgggcg agtcggagca 480 

ttggcatgct gggagcacta taaccctttg gctagataca gcctcatgac acagcatgaa 540 

gaaattcact gtgctcaatt tccaggctcc atggttggtc agatcttcgc agatcaaatg 600 

gatgtcacga ttcgtcatca tgccttggag tcaggttgct tcgtcatcaa ctccactggc 660 

tggttaactg atgatcagat caaatctatc accgacgatc ccaaaatgca gaaagcttta 720 

agaggtggtt gcaacacggc cattatttct ccagaaggga atcatttaac cgagcctttg 780 

cgagaaggtg aaggcatggt gattgctgat cttgatatgg cactcatcac caaacgaaaa 840 

agaatgatgg actcagttgg ccactacgcc agaccagaac tgttgagctt agcgatcaat 900 

gatgctccgg ctactccttc attccagatg aacgaacatc gtcttaaatc agtgcaatta 960 

cctatcgcag aggagcttaa aaatgacaac aagcttagca gtggacagta a 1011 

<210> 130 

<211> 336 

<212> PRT 

<213> Unknown - 

<220> 
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<223> Obtained from an environmental sample 
<400> 130 

Met Ser Glu Lys Arg He He Lys Ala Ala Ala Val Gin He Thr Pro 

15 10 15 

Asp Phe Glu Ser His Asp Gly Thr Val Lys Lys Val Cys Asn Val He 

20 25 30 

Asp Glu Ala Gly Ala Lys Gly Val Gin He He Val Phe Pro Glu Thr 

35 40 45 

Phe He Pro Tyr Tyr Pro Tyr Phe Ser Phe He Thr Pro Pro Val Thr 

50 55 60 

Ala Gly Ala Glu His Leu Arg Leu Tyr Glu Lys Ser Val Val He Pro 
65 70 75 80 

Gly Pro Val Thr Gin Ala He Ser Glu Arg Ala Arg Met Asn Asn Met 

85 90 95 

Val Val Val Leu Gly Val Asn Glu Arg Asp Asn Gly Ser Leu Tyr Asn 

100 105 110 

Thr Gin He He Phe Asp Ala Thr Gly Glu Met Leu Leu Lys Arg Arg 

115 120 125 

Lys He Thr Pro Thr Tyr His Glu Arg Met He Trp Gly Gin Gly Asp 

130 135 140 

Ala Ser Gly Leu Lys Val Val Asp Thr Ala He Gly Arg Val Gly Ala 
145 150 155 160 

Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ser Leu Met 

165 170 175 

Thr Gin His Glu Glu He His Cys Ala Gin Phe Pro Gly Ser Met Val 

180 185 190 

Gly Gin He Phe Ala Asp Gin Met Asp Val Thr He Arg His His Ala 

195 200 205 

Leu Glu Ser Gly Cys Phe Val He Asn Ser Thr Gly Trp Leu Thr Asp 

210 215 220 

Asp Gin He Lys Ser He Thr Asp Asp Pro Lys Met Gin Lys Ala Leu 
225 230 235 240 

Arg Gly Gly Cys Asn Thr Ala He He Ser Pro Glu Gly Asn His Leu 

245 250 255 

Thr Glu Pro Leu Arg Glu Gly Glu Gly Met Val He Ala Asp Leu Asp 

260 265 270 

Met Ala Leu He Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His 

275 280 285 

Tyr Ala Arg Pro Glu Leu Leu Ser Leu Ala lie Asn Asp Ala Pro Ala 

290 295 300 

Thr Pro Ser Phe Gin Met Asn Glu His Arg Leu Lys Ser Val Gin Leu 
305 310 315 320 

Pro He Ala Glu Glu Leu Lys Asn Asp Asn Lys Leu Ser Ser Gly Gin 
325 330 335 

<210> 131 
<211> 1011 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 131 

atgtcagaaa agcgaattat taaagcggct gcagttcaaa tcacaccaga ttttgaatcg 60 

catgatggaa ccgtaaagaa ggtttgtaat gtaattgatg aagcgggtgc taaaggtgta 120 

cagatcattg tattccctga aacctttatt ccatattacc catatttttc tttcatcact 180 

ccaccagtga ctgctggcgc ggagcatttg cggctctatg aaaaaagtgt cgtgatacct 240 

ggtcccgtta ctcaagacat ttccgaacgt gcacgcatga ataatatggt tgttgtactt 300 

ggtgtaaatg agcgtgataa cggcagtcta tataacaccc agattatttt tgatgctacc 360 
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ggtgagatgc ttctgaagag aagaaaaatc acacctacct atcatgagcg catgatttgg 420 

gggcaaggag atgcttcagg cctgaaggtc gtcgatacgg ctattgggcg agtcggagca 480 

ttggcatgct gggagcacta taaccctttg gctagataca gcctcatgac acagcatgaa 540 

gaaattcact gtgctcaatt tccaggctcc atggttggtc agatcttcgc agatcaaatg 600 

gatgtcacga ttcgtcatca tgccttggag tcaggttgct tcgtcatcaa ctccactggc 660 

tggttaactg atgatcagat caaatctatc accgacgatc ccaaaatgca gaaagcttta 720 

agaggtggtt gcaacacggc cattatttct ccagaaggga atcatttaac cgagcctttg 790 

cgagaaggtg aaggcatggt gattgctgat cttgatatgg cactcatcac caaacgaaaa 840 

agaatgatgg actcagttgg ccactacgcc agaccagaac tgttgagctt agcgatcaat 900 

gatgctccgg ctactccttc attccagatg aacgaacatc gtcttaaatc agtgcaatta 960 

cctatcgcag aggagcttaa aaatgacaac aagcttagca gtggacagta a 1011 

<210> 132 
<211> 336 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 132 



Met 


Ser 


Glu 


Lys 


Arg 


He 


lie 


Lys 


Ala Ala 


Ala Val 


Gin lie Thr 


Pro 


1 








5 








10 




15 




Asp 


Phe 


Glu 


Ser 


His 


Asp 


Gly 


Thr 


Val Lys 


Lys Val 


Cys Asn Val 


He 








20 










25 




30 




Asp 


Glu 


Ala 


Gly 


Ala 


Lys 


Gly 


Val 


Gin He 


lie Val 


Phe Pro Glu 


Thr 






35 










40 






45 




Phe 


He 


Pro 


Tyr 


Tyr 


Pro 


Tyr 


Phe 


Ser Phe 


He Thr 


Pro Pro Val 


Thr 




50 










55 






60 






Ala 


Gly 


Ala 


Glu 


His 


Leu 


Arg 


Leu 


Tyr Glu 


Lys Ser 


Val Val He 


Pro 


65 










70 








75 




80 


Gly 


Pro 


Val 


Thr 


Gin 


Asp 


He 


Ser 


Glu Arg 


Ala Arg 


Met Asn Asn 


Met 










85 








90 




95 




Val 


Val 


Val 


Leu 


Gly 


Val 


Asn 


Glu 


Arg Asp 


Asn Gly 


Ser Leu Tyr 


Asn 








100 










105 




110 




Thr 


Gin 


He 


He 


Phe 


Asp 


Ala 


Thr 


Gly Glu 


Met Leu 


Leu Lys Arg Arg 






115 










120 






125 




Lys 


He 


Thr 


Pro 


Thr 


Tyr 


His 


Glu 


Arg Met 


lie Trp 


Gly Gin Gly Asp 




130 










135 






140 






Ala 


Ser 


Gly 


Leu 


Lys 


Val 


Val 


Asp 


Thr Ala 


lie Gly 


Arg Val Gly Ala 


145 










150 








155 




160 


Leu 


Ala 


Cys 


Trp 


Glu 


His 


Tyr 


Asn 


Pro Leu 


Ala Arg 


Tyr Ser Leu 


Met 










165 








170 




175 




Thr 


Gin 


His 


Glu 


Glu 


lie 


His 


Cys 


Ala Gin 


Phe Pro 


Gly Ser Met 


Val 








180 










185 




190 




Gly 


Gin 


He 


Phe 


Ala 


Asp 


Gin 


Met 


Asp Val 


Thr He 


Arg His His 


Ala 




195 










200 






205 




Leu 


Glu 


Ser 


Gly 


Cys 


Phe 


Val 


He 


Asn Ser 


Thr Gly 


Trp Leu Thr 


Asp 




210 










215 






220 






Asp 


Gin 


lie 


Lys 


Ser 


He 


Thr 


Asp 


Asp Pro 


Lys Met 


Gin Lys Ala Leu 


225 










230 








235 




240 


Arg 


Gly 


Gly 


Cys 


Asn 


Thr 


Ala 


lie 


lie Ser 


Pro Glu 


Gly Asn His 


Leu 








245 








250 




255 




Thr 


Glu 


Pro 


Leu 


Arg 


Glu 


Gly 


Glu 


Gly Met 


Val lie 


Ala Asp Leu 


Asp 








260 








265 




270 




Met 


Ala 


Leu 


He 


Thr 


Lys 


Arg 


Lys 


Arg Met 


Met Asp 


Ser Val Gly His 






275 










280 






285 




Tyr 


Ala 


Arg 


Pro 


Glu 


Leu 


Leu 


Ser 


Leu Ala 


lie Asn 


Asp Ala Pro 


Ala 


290 










295 






300 






Thr 


Pro 


Ser 


Phe 


Gin 


Met 


Asn 


Glu 


His Arg 


Leu Lys 


Ser Val Gin 


Leu 


305 










310 








315 




320 
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Pro lie Ala Glu Glu Leu Lys Asn Asp Asn Lys Leu Ser Ser Gly Gin 
325 330 335 



<210> 133 
<211> 1026 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 133 

atgtcgacca agcggatcgt acgcgccgct gccgttcagc tggcaccgga tctggagcgg 60 

ccggagggca cactggagaa ggtttgcgcg gccatcgaca aggcggcggg ggacggtgtg 120 

cagctcatcg tcttccccga gaccttcgta ccgtactacc cgtacttctc tttcgtgcgt 180 

gcgccggtcg cgatgggtgc cgagcacatg cggttatacg agcgcgcggt agcggtgccc 240 

ggtccagtaa cggccaccgt ggcggagcgg gcaaaagcgc acgcgatggt cgtcgtgctg 300 

ggtgtaaacg agcgcgatca cggctcactg tataacgcgc aactgatctt cgacgagacc 360 

ggccgtctcg tcctcaaacg ccgcaagatc actccgacct atcacgagcg catggtgtgg 420 

gggcagggcg acggcagcgg ccttaaggtt gtagacaccg gtatcggcag gatcggagcc 480 

ctcgcctgct gggagcacta caacccgctc gcgcgctatg cgctcatggc gcagcacgaa 540 

gagattcatt gcgcgcagtt tccgggctcg atggtggggc cgatcttcgc ggatcagatc 600 

gaggtcacga tccgccatca cgcgctggag tcgggctgct tcgtcgtcaa tgcgaccggc 660 

tggctgacac ccgaacagat cgcgtcgatc acaccggacg cgggtctgca aaaggcaatc 720 

agcgggggct gcaacaccgc gatcatctcg ccggagggcg tgcacctggc cccgccgttg 780 

cgagaaggtg agggcatggt cgtggccgac ctcgacatgg cgctcatcac caaacgcaaa 840 

cgcatgatgg attcggtggg tcactacgct cgcccggagt tgctcagcct gcgcatcgat 900 

agccgcgccg cttcgccgat gtcgtcacaa atggaaatac ccgggagctt gcatgaaatc 960 

accagccacg atgtccagcc agcaactgat gaccgagctc cagtcctccg gcttgaggtt 1020 

ggctga 1026 

<210> 134 
<211> 341 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 134 



Met 


Ser 


Thr 


Lys 


Arg 


He 


Val 


Arg 


Ala 


Ala 


Ala 


Val 


Gin 


Leu 


Ala 


Pro 


1 








5 










10 










15 




Asp 


Leu 


Glu 


Arg 
20 


Pro 


Glu 


Gly 


Thr 


Leu 
25 


Glu 


Lys 


Val 


Cys 


Ala 
30 


Ala 


He 


Asp 


Lys 


Ala 
35 


Ala 


Gly 


Asp 


Gly 


Val 
40 


Gin 


Leu 


He 


Val 


Phe 
45 


Pro 


Glu 


Thr 


Phe 


Val 
50 


Pro 


Tyr 


Tyr 


Pro 


Tyr 
55 


Phe 


Ser 


Phe 


Val 


Arg 
60 


Ala 


Pro 


Val 


Ala 


Met 


Gly 


Ala 


Glu 


His 


Met 


Arg 


Leu 


Tyr 


Glu 


Arg 


Ala 


Val 


Ala 


Val 


Pro 


65 










70 










75 










80 


Gly 


Pro 


Val 


Thr 


Ala 
85 


Thr 


Val 


Ala 


Glu 


Arg 
90 


Ala 


Lys 


Ala 


His 


Ala 
95 


Met 


Val 


Val 


Val 


Leu 
100 


Gly 


Val 


Asn 


Glu 


Arg 
105 


Asp 


His 


Gly 


Ser 


Leu 
110 


Tyr 


Asn 


Ala 


Gin 


Leu 
115 


lie 


Phe 


Asp 


Glu 


Thr 
120 


Gly 


Arg 


Leu 


Val 


Leu 
125 


Lys 


Arg 


Arg 


Lys 


lie 
130 


Thr 


Pro 


Thr 


Tyr 


His 
135 


Glu 


Arg 


Met 


Val 


Trp 
140 


Gly 


Gin 


Gly 


Asp 


Gly 


Ser 


Gly 


Leu 


Lys 


Val 


Val 


Asp 


Thr 


Gly 


He 


Gly 


Arg 


He 


Gly 


Ala 


145 










150 










155 










160 


Leu 


Ala 


Cys 


Trp 


Glu 


His 


Tyr 


Asn 


Pro 


Leu 


Ala 


Arg 


Tyr 


Ala 


Leu 


Met 
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165 










170 








175 




Ala 


Gin 


His 


Glu 
180 


Glu 


lie 


His 


Cys 


Ala 
185 


Gin 


Phe 


Pro 


Gly 


Ser Met 
190 


Val 


Gly 


Pro 


lie 
195 


Phe 


Ala 


Asp 


Gin 


He 
200 


Glu 


Val 


Thr 


He 


Arg 
205 


His His 


Ala 


Leu 


Glu 
210 


Ser 


Gly 


Cys 


Phe 


Val 
215 


Val 


Asn 


Ala 


Thr 


Gly 
220 


Trp 


Leu Thr 


Pro 


Glu 


Gin 


lie 


Ala 


Ser 


He 


Thr 


Pro 


Asp 


Ala 


Gly 


Leu 


Gin 


Lys Ala 


He 


225 










230 










235 








240 


Ser Gly 


Gly 


Cys 


Asn 


Thr 


Ala 


He 


He 


Ser 


Pro 


Glu 


Gly 


Val His 


Leu 










245 










250 








255 




Ala 


Pro 


Pro 


Leu 
260 


Arg 


Glu 


Gly 


Glu 


Gly 
265 


Met 


Val 


Val 


Ala 


Asp Leu 
270 


Asp 


Met 


Ala 


Leu 
275 


lie 


Thr 


Lys 


Arg 


Lys 
280 


Arg 


Met 


Met 


Asp 


Ser 
285 


Val Gly 


His 


Tyr Ala 


Arg 


Pro 


Glu 


Leu 


Leu 


Ser 


Leu 


Arg 


He 


Asp 


Ser 


Arg Ala 


Ala 




290 










295 










300 








Ser 


Pro 


Met 


Ser 


Ser 


Gin 


Met 


Glu 


He 


Pro 


Gly 


Ser 


Leu 


His Glu 


He 


305 










310 










315 








320 


Thr 


Ser 


His 


Asp 


Val 
325 


Gin 


Pro 


Ala 


Thr 


Asp 
330 


Asp 


Arg 


Ala 


Pro Val 
335 


Leu 


Arg 


Leu 


Glu 


Val 
340 


Gly 























<210> 135 
<211> 1011 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 135 

atgtcagaca agcgaatcat taaagcggct gcagttcaaa tcactcctga ctttgactca 60 

gcagatggaa ccgttaagaa agtgtgcaag gtaatcgatg aagcaggtgc aaagggagtt 120 

caaattattg tattcccgga aaccttcatc ccttactacc catacttttc attcattaca 180 

cctccagtca ctgctggcgc tgagcattta aagctttatg agaaaagtgt cgtgatacct 240 

ggcccggtta cccaagcgat tgccgagcga gccagggtta atcagatggt tgtcgtgctt 300 

ggtgtcaacg agcgagataa cggtagcctc tacaacacac aattgatctt tgataccaac 360 

ggcgaactgc tacttaaaag aagaaaaatc acccctacct accatgaacg tatgatctgg 420 

gggcaaggtg atgcatcagg tctcaaagta gttgaaacag agatcgcccg agtaggtgcc 480 

ttggcttgtt gggaacacta caacccactg gccagatatg cactcatgac acagcatgaa 540 

gaaattcact gtgcgcaatt cccaggctct atggttggcc agatatttgc cgatcagatg 600 

gatgtcacta tccgacatca cgccttagag tcaggctgct tcgtcatcaa cgccactggc 660 

tggctcaccg acgcgcaaat ccaatcgatt actgatgacc caaaaatgca aaaagcatta 720 

cgtggcggct gcaacacagc catcatctcc cccgaagggg tgcacttaac agagccacta 780 

cgtgaaggag aaggcatggt gattgccaat cttgatatgg cactcatcac aaaacgaaaa 840 

agaatgatgg attcggtagg ccattattca agaccagaat tattaagcct ggcaattaac 900 

gacaaaccag caactacaac attttcaatg actgaggggc gtactcaaac agagccattt 960 

cgaatcgcag aggagttgaa aaatgacgac aagcttagca ctggaaacta a 1011 

<210> 136 
<211> 336 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 136 

Met Ser Asp Lys Arg He He Lys Ala Ala Ala Val Gin He Thr Pro 
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1 






5 




Asp 


Phe 


Asp 


Ser Ala Asp Gly 


Thr 








20 




Asp 


Glu 


Ala 


Gly Ala Lys Gly 


Val 






35 




40 


Phe 


lie 


Pro 


Tyr Tyr Pro Tyr 


Phe 




50 




55 




Ala Gly 


Ala 


Glu His Leu Lys 


Leu 


65 






70 




Gly 


Pro 


Val 


Thr Gin Ala He 


Ala 








85 




Val 


Val 


Val 


Leu Gly Val Asn 


Glu 








100 




Thr 


Gin 


Leu 


He Phe Asp Thr 


Asn 






115 




120 


Lys 


lie 


Thr 


Pro Thr Tyr His 


Glu 




130 




135 




Ala 


Ser 


Gly 


Leu Lys Val Val 


Glu 


145 






150 




Leu 


Ala 


Cys 


Trp Glu His Tyr 


Asn 








165 




Thr 


Gin 


His 


Glu Glu He His 


Cys 








180 




Gly Gin 


He 


Phe Ala Asp Gin 


Met 






195 




200 


Leu 


Glu 


Ser 


Gly Cys Phe Val 


He 




210 




215 




Ala 


Gin 


He 


Gin Ser He Thr 


Asp 


225 






230 




Arg Gly 


Gly 


Cys Asn Thr Ala 


He 








245 




Thr 


Glu 


Pro 


Leu Arg Glu Gly 


Glu 








260 




Met 


Ala 


Leu 


He Thr Lys Arg 


Lys 






275 




280 


Tyr 


Ser 


Arg 


Pro Glu Leu Leu 


Ser 




290 




295 




Thr 


Thr 


Thr 


Phe Ser Met Thr 


Glu 


305 






310 




Arg 


He 


Ala 


Glu Glu Leu Lys 


Asn 



325 



10 15 



Val Lys Lys Val 


Cys Lys Val 


He 


25 


30 




Gin He He Val 


Phe Pro Glu 


Thr 




45 




Ser Phe He Thr 


Pro Pro Val 


Thr 


60 






Tyr Glu Lys Ser 


Val Val He 


Pro 


75 




80 


Glu Arg Ala Arg 


Val Asn Gin 


Met 


90 


95 




Arg Asp Asn Gly 


Ser Leu Tyr 


Asn 


105 


110 




Gly Glu Leu Leu 


Leu Lys Arg 


Arg 




125 




Arg Met He Trp 


Gly Gin Gly 


Asp 


140 






Thr Glu He Ala 


Arg Val Gly 


Ala 


155 




160 


Pro Leu Ala Arg 


Tyr Ala Leu 


Met 


170 • 


175 




Ala Gin Phe Pro 


Gly Ser Met 


Val 


185 


190 




Asp Val Thr He 


Arg His His 


Ala 




205 




Asn Ala Thr Gly 


Trp Leu Thr 


Asp 


220 






Asp Pro Lys Met 


Gin Lys Ala 


Leu 


235 




240 


He Ser Pro Glu 


Gly Val His 


Leu 


250 


255 




Gly Met Val He 


Ala Asn Leu 


Asp 


265 


270 




Arg Met Met Asp 


Ser Val Gly 


His 




285 




Leu Ala He Asn 


Asp Lys Pro 


Ala 


300 






Gly Arg Thr Gin 


Thr Glu Pro 


Phe 


315 




320 


Asp Asp Lys Leu 


Ser Thr Gly 


Asn 


330 


335 





<210> 137 
<211> 978 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 137 

atggctattg tcaaggccgc ggcggtgcag atcagtccgg tgctctacag tcgcgccggc 60 

acagtggaca aggtcgtcgc gaagatccgc gagctgggcc gacgaggggt cgagttcgcc 120 

gtcttccccg agaccgtcat tccctactat ccctacttct ctttcgtgca gcccccctac 180 

acccaggcca ccgaacacct gcgcctgctc gaggaatcgg tgaccgtgcc ctccgccgaa 240 

accgacgcga tcgccaaggc cgctcgcgag gcgggcatgg tcgtctccat cggcgtcaac 300 

gagcgcgacg gcggaaccat ctacaacacc caactcctct tcgacgccga cggcactctc 360 

atccagcgcc gccgcaagat cacccccacc tatcacgaac gcatggtctg ggggcaggga 420 

gacggctcag gtctgcgcgc cgtcgacagt gcggtcggcc gcatcggcca gctcgcctgc 4 80 

tgggagcact accagccact ggcccggtac gccctcatcg ctgacggcga gcagatccac 540 

gccgcgatgt accccggcgc cttcggcggc gatctgttcg ccgagcagat cgaggtcaac 600 
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atccgccagc acgccctgga atccgccagc ttcgtcgtca acgccaccgc ctggctcgac 660 

gccgatcagc aggcccagat cgccaaggac accggaggcc cggtcccggc cttctccggt 720 

ggcttcttca ccgccatcgt cgaccccgaa ggccgtatca tcggcgaccc cctcaccagc 780 

ggcgaaggcg aagtgatcgc cgacctcgat ctcgctctca tcaaccgccg caagcgcctc 84 0 

atggacgcca gtggacacta ccagccgccc gaaattctta gcttcacatt gaccggtgca 900 

ccggcgcctt atgtcaagag cgcggcgtgc cggggaaccc cgggtacgac cgtggccgag 960 

gagggacggt ccgcttag 978 

<210> 138 
<211> 325 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 138 

Met Ala lie Val Lys Ala Ala Ala Val Gin He Ser Pro Val Leu Tyr 

15 10 15 

Ser Arg Ala Gly Thr Val Asp Lys Val Val Ala Lys He Arg Glu Leu 

20 25 30 

Gly Arg Arg Gly Val Glu Phe Ala Val Phe Pro Glu Thr Val He Pro 

35 40 45 

Tyr Tyr Pro Tyr Phe Ser Phe Val Gin Pro Pro Tyr Thr Gin Ala Thr 

50 55 60 

Glu His Leu Arg Leu Leu Glu Glu Ser Val Thr Val Pro Ser Ala Glu 
65 70 75 80 

Thr Asp Ala He Ala Lys Ala Ala Arg Glu Ala Gly Met Val Val Ser 

85 90 95 

He Gly Val Asn Glu Arg Asp Gly Gly Thr He Tyr Asn Thr Gin Leu 

100 105 110 

Leu Phe Asp Ala Asp Gly Thr Leu He Gin Arg Arg Arg Lys He Thr 

115 120 125 

Pro Thr Tyr His Glu Arg Met Val Trp Gly Gin Gly Asp Gly Ser Gly 

130 135 140 

Leu Arg Ala Val Asp Ser Ala Val Gly Arg He Gly Gin Leu Ala Cys 
145 150 155 160 

Trp Glu His Tyr Gin Pro Leu Ala Arg Tyr Ala Leu He Ala Asp Gly 

165 170 175 

Glu Gin He His Ala Ala Met Tyr Pro Gly Ala Phe Gly Gly Asp Leu 

180 185 190 

Phe Ala Glu Gin He Glu Val Asn He Arg Gin His Ala Leu Glu Ser 

195 200 205 

Ala Ser Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp Gin Gin 

' 210 215 220 

Ala Gin He Ala Lys Asp Thr Gly Gly Pro Val Pro Ala Phe Ser Gly 
225 230 235 240 

Gly Phe Phe Thr Ala He Val Asp Pro Glu Gly Arg He He Gly Asp 

245 250 255 

Pro Leu Thr Ser Gly Glu Gly Glu Val He Ala Asp Leu Asp Leu Ala 

260 265 270 

Leu lie Asn Arg Arg Lys Arg Leu Met Asp Ala Ser Gly His Tyr Gin 

275 280 285 

Pro Pro Glu He Leu Ser Phe Thr Leu Thr Gly Ala Pro Ala Pro Tyr 

290 295 300 

Val Lys Ser Ala Ala Cys Arg Gly Thr Pro Gly Thr Thr Val Ala Glu 
305 310 315 320 

Glu Gly Arg Ser Ala 
325 

<210> 139 
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<211> 999 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 139 

atgaaaacaa cggttaccgt tgcctgcgtt caggccgccc ccgtatttat ggatttagaa 60 

ggcaccgtag ataaaacaat caccctcatc tctgaagccg cacagaaagg cgcggagctc 120 

atcgcttttc cggagacctg gatacccggt tacccgtggt tcttatggct gaactcgccc 180 

gccacaaata tgcccctggt ttatcagtat catcagaact ctctggtgct ggacagtacc 240 

caggcgaagc gaattgcgga tgcggcacgg cagaataaca tcactgtcgc tctgggcttc 300 

agcgaacgcg atcatggaag cctctatatc gcacagtggc tgattggcag cgacggggag 360 

accattggca tccggcgcaa gctcaaggcc acgcacgtgg agcgtacgct gttcggcgaa 420 

agcgacggct cctccctgac cacctgggag acacctctgg gtaacgtcgg ggccctctgc 4 80 

tgctgggagc acctgcagcc gctgtcccgc tatgcaatgt attcccagca tgaggagatc 540 

cacatcgctg cctggcccag tttcagtctc tacaccagtg caacggccgc actgggtcct 600 

gacgtcaata cggcggcttc acgcctctat gccgcggagg ggcagtgctt cgtgatagcc 660 

ccgtgtgccg tggtttctga tgaaatgatt gatttactct gtcctgatga tgaccggaga 720 

gcgttactca gtgccggagg gggacatgcc cgtatttacg gcccggacgg aagagaactc 780 

gtcacccctc tcggggaaaa tgaggaagga ctgcttatcg ctgagctcga ctctgctgcg 840 

attacctttg ccaaactggc ggcagacccg gttggccact attcccgtcc tgacgtgacc 900 

cgcctccttt ttaatccttc agccaacaag actgtgatta aacgacattc gcctcctgag 960 

ttaattgccg agcagactgc agaagaagag gaggagtag 999 

<210> 140 
<211> 332 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 140 



Met 


Lys 


Thr 


Thr 


Val 


Thr 


Val 


Ala 


Cys Val 


Gin 


Ala 


Ala Pro Val 


Phe 


1 






5 








10* 






15 




Met 


Asp 


Leu 


Glu 


Gly 


Thr 


Val 


Asp 


Lys Thr 


He 


Thr 


Leu He Ser 


Glu 






20 










25 






30 




Ala 


Ala 


Gin 


Lys 


Gly 


Ala 


Glu 


Leu 


He Ala 


Phe 


Pro 


Glu Thr Trp 


He 






35 










40 








45 




Pro 


Gly 


Tyr 


Pro 


Trp 


Phe 


Leu 


Trp 


Leu Asn 


Ser 


Pro 


Ala Thr Asn 


Met 




50 










55 








60 






Pro 


Leu 


Val 


Tyr 


Gin 


Tyr 


His 


Gin 


Asn Ser 


Leu 


Val 


Leu Asp Ser 


Thr 


65 








70 








75 






80 


Gin 


Ala 


Lys 


Arg 


He 


Ala 


Asp 


Ala 


Ala Arg 


Gin 


Asn 


Asn He Thr 


Val 






85 








90 






95 




Ala 


Leu 


Gly 


Phe 


Ser 


Glu 


Arg 


Asp 


His Gly 


Ser 


Leu 


Tyr He Ala 


Gin 








100 










105 






110 




Trp 


Leu 


He 


Gly 


Ser 


Asp 


Gly 


Glu 


Thr He 


Gly 


He 


Arg Arg Lys 


Leu 






115 










120 








125 




Lys 


Ala 


Thr 


His 


Val 


Glu 


Arg 


Thr 


Leu Phe 


Gly Glu 


Ser Asp Gly 


Ser 




130 










135 








140 






Ser 


Leu 


Thr 


Thr 


Trp 


Glu 


Thr 


Pro 


Leu Gly 


Asn 


Val 


Gly Ala Leu 


Cys 


145 








150 








155 






160 


Cys 


Trp 


Glu 


His 


Leu 


Gin 


Pro 


Leu 


Ser Arg 


Tyr Ala Met Tyr Ser 


Gin 








165 








170 






175 




His 


Glu 


Glu 


He 


His 


He 


Ala 


Ala 


Trp Pro 


Ser 


Phe 


Ser Leu Tyr 


Thr 








180 










185 






190 




Ser 


Ala 


Thr 


Ala 


Ala 


Leu 


Gly 


Pro 


Asp Val 


Asn 


Thr 


Ala Ala Ser 


Arg 



195 200 205 
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Leu Tyr Ala Ala Glu Gly Gin Cys Phe Val He Ala Pro Cys Ala Val 

2io 215 220 

Val Ser Asp Glu Met He Asp Leu Leu Cys Pro Asp Asp Asp Arg Arg 
225 230 235 240 

Ala Leu Leu Ser Ala Gly Gly Gly His Ala Arg He Tyr Gly Pro Asp 

245 250 255 

Gly Arg Glu Leu Val Thr Pro Leu Gly Glu Asn Glu Glu Gly Leu Leu 

260 • 265 270 

He Ala Glu Leu Asp Ser Ala Ala He Thr Phe Ala Lys Leu Ala Ala 

275 280 285 

Asp Pro Val Gly His Tyr Ser Arg Pro Asp Val Thr Arg Leu Leu Phe 

290 295 300 

Asn Pro Ser Ala Asn Lys Thr Val He Lys Arg His Ser Pro Pro Glu 
305 310 315 320 

Leu lie Ala Glu Gin Thr Ala Glu Glu Glu Glu Glu 
325 330 

<210> 141 
<211> 1026 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 141 . 

atggtgttca aggcagcgac tgttcatgca gctccggtat tcatggacaa ggaagcgtcg 60 

atagataagg ctatcgacct catcaagaag gccggtcagg aagggattaa gcttctggtt 120 

tttccggaaa cgtttattcc gggctatccg tattttatcg aatgctatcc gccgcttgcg 180 

caggtggaag cgctcgccca gtacactgac gcttccgtgg agatcgacgg cccggaagtc 240 

acccggcttc agcaggtagc caaggcggca ggcgttgcag tcgtcatggg catcagcgaa 300 

cgaatggctg agacccgaac ctgcttcaac tcgcaggtgt tcattgacgt cgacggcacg 360 

ctgctcggcg tgcatcgcaa gctgcagccg acttatgccg agcgcaaggt atgggcacag 420 

ggcggtggtt atacgctgag gacctacaag agctcgcttg gcgtgctcgg cggtcttgcc 480 

tgctgggagc acacgatgaa cctcgcgcgg caggccctga tcatgcagag cgagcagatc 540 

catgcggctg catggcccgg actatcgacg atgcgaggtt tcgagcccgt ggccgatatc 600 

cagatcgacg ccatgatgaa gactcacgcg cttaccgcac agtgctgggt gctttcggcc 660 

ggcaatcccg tcgaccggac ctgcctcgac tggatggaaa agaacatcgg accgcaggat 720 

tacgtcaccg agggcggcgg atggagcgcc gttatccatc cgttcaacag ctatctcggc 780 

ggccctcaca cgggccttga ggaaaagctg gtcgtcggcg agatcaatct ggacgatctc 840 

aagttcgtca aagtctggct cgacagcaaa gggcactatg ctcggccgga aatcctgaaa 900 

cttggcgtca accaaaagca gatttggcct gatgaacatt tgctggcgcg gcaggatgtg 960 

accgagttgc tggaggcgga tatcatcgaa taccccttgc aactgttgca agaccgcgcg 1020 
caatag 

<210> 142 
<211> 341 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<400> 142 M fc _ 

Met Val Phe Lys Ala Ala Thr Val His Ala Ala Pro Val Phe Met Asp 

15 10 15 

Lys Glu Ala Ser He Asp Lys Ala He Asp Leu He Lys Lys Ala Gly 

20 25 30 

Gin Glu Gly He Lys Leu Leu Val Phe Pro Glu Thr Phe He Pro Gly 

35 40 45 

Tyr Pro Tyr Phe He Glu Cys Tyr Pro Pro Leu Ala Gin Val Glu Ala 



1020 
1026 
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50 


55 


60 




Leu 


Ala 


Gin Tyr Thr Asp Ala 


Ser Val Glu He Asp 


Gly Pro Glu Val 


65 




70 


75 


80 


Thr 


Arg 


Leu Gin Gin Val Ala 


Lys Ala Ala Gly Val 


Ala Val Val Met 




85 


90 


95 


Gly 


He 


Ser Glu Arg Met Ala 


Glu Thr Arg Thr Cys 


Phe Asn Ser Gin 




100 


105 


110 


Val 


Phe 


He Asp Val Asp Gly 


Thr Leu Leu Gly Val 


His Arg Lys Leu 






115 


120 


125 


Gin 


Pro 


Thr Tyr Ala Glu Arg 


Lys Val Trp Ala Gin 


Gly Gly Gly Tyr 




130 


135 


140 




Thr 


Leu 


Arg Thr Tyr Lys Ser 


Ser Leu Gly Val Leu 


Gly Gly Leu Ala 


145 




150 


155 


160 


Cys 


Trp 


Glu His Thr Met Asn 


Leu Ala Arg Gin Ala 


Leu He Met Gin 


165 


170 


175 


Ser 


Glu 


Gin He His Ala Ala 


Ala Trp Pro Gly Leu 


Ser Thr Met Arg 






180 


185 


190 


Gly Phe 


Glu Pro Val Ala Asp 


He Gin He Asp Ala 


Met Met Lys Thr 






195 


200 


205 


His 


Ala 


Leu Thr Ala Gin Cys 


Trp Val Leu Ser Ala 


Gly Asn Pro Val 




210 


215 


220 




Asp Arg 


Thr Cys Leu Asp Trp 


Met Glu Lys Asn He 


Gly Pro Gin Asp 


225 




230 


235 


240 


Tyr Val 


Thr Glu Gly Gly Gly 


Trp Ser Ala Val He 


His Pro Phe Asn 






245 


250 


255 


Ser 


Tyr 


Leu Gly Gly Pro His 


Thr Gly Leu Glu Glu 


Lys Leu Val Val 






260 


265 


270 


Gly Glu 


He Asn Leu Asp Asp 


Leu Lys Phe Val Lys 


Val Trp Leu Asp 






275 


280 


285 


Ser 


Lys 


Gly His Tyr Ala Arg 


Pro Glu lie Leu Lys 


Leu Gly Val Asn 




290 


295 


300 




Gin 


Lys 


Gin He Trp Pro Asp 


Glu His Leu Leu Ala 


Arg Gin Asp Val 


305 




310 


315 


320 


Thr 


Glu 


Leu Leu Glu Ala Asp 


He He Glu Tyr Pro 


Leu Gin Leu Leu 






325 


330 


335 


Gin Asp 


Arg Ala Gin 







340 



<210> 143 
<211> 1122 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 143 

atgacgatca ttgcaggcgc ggttcatgcg gcgccggtat tcatggatgt cgatgccact 60 

atcgacaagg catgcgaaat cattcgcaag gcaggcaaag acggaatcga gcttctcgtc 120 

ttccctgagg ttttcgtacc cggctacccc tacttcatcg agtgctatcc gaccttgaac 180 

caaaccgctg cgctggccgc ctatacggat gcctcgatcg aggttccagg cccggaagtc 240 

cggcgcttgc aggtggccgc acatcaggcc ggcgtgatgg ttgtgatggg cgtgagcgag 300 

cgtctgcgcg gatctcgcac ctgcttcaac agccaggtgt tcatcgaccg tgacggcacc 360 

ttgctgggcg tgcaccgcaa actccagccg acctatgtcg agcgcatcgt ctggggccag 420 

ggcggcggac acaccctcaa ggtattcgac agcacactgg gcaaggtggg cggactggcc 480 

tgctgggagc acacgatgaa cctcgcgcgc catgcgttga tcgcccaggg tatccagatc 540 

catgccgccg cctggcctgg gctttcgaca atggccgggt tcgaagcggt ggctgacgtc 600 

cagatcgacg cgatgatgaa aactcatgcg ttgagcgcgc aatgctttgt cgtatcggcc 660 

gcaaaccctg tggatcagac ctgcctggag tggatggaga aacacctcgg cccgcagcaa 720 

ctcgttaccg ccggcggagg ctggtcggca atcgtccatc ctttctgtgg ttatatcgcc 780 

gcccctcaca ccggtgccga ggagaaggtt ctggtaggcg aaatcaatct ggacgacctc 840 
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aagcaggtca aggtatgggt tgattccgca ggtcattatg cgcgcccgga agtcgtgcaa 900 

ttgcgcgacg ccctggagag ccgtggcaat tatcgcgttg cgctgacccg cgacgccgac 960 

accttcgtgc cgctggaaga ccgcgtgcgc tttgcgcgcc agcagaacgc cgacctcttc 1020 

atctcgatcc acgccgacgc caacgccaac cacgatgcgc gcggggctgg cttcacttcg 1080 

aaggttgaaa acctttccac gggcatttta ccaggcgatt ga 1122 

<210> 144 
<211> 373 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 144 

Met Thr He He Ala Gly Ala Val His Ala Ala Pro Val Phe Met Asp 

15 10 15 

Val Asp Ala Thr He Asp Lys Ala Cys Glu He He Arg Lys Ala Gly 

20 25 30 

Lys Asp Gly He Glu Leu Leu Val Phe Pro Glu Val Phe Val Pro Gly 

35 40 .45 

Tyr Pro Tyr Phe He Glu Cys Tyr Pro Thr Leu Asn Gin Thr Ala Ala 

50 55 60 

Leu Ala Ala Tyr Thr Asp Ala Ser He Glu Val Pro Gly Pro Glu Val 
65 70 75 80 

Arg Arg Leu Gin Val Ala Ala His Gin Ala Gly Val Met Val Val Met 

85 90 95 

Gly Val Ser Glu Arg Leu Arg Gly Ser Arg Thr Cys Phe Asn Ser Gin 

100 105 HO 

Val Phe He Asp Arg Asp Gly Thr Leu Leu Gly Val His Arg Lys Leu 

115 120 125 

Gin Pro Thr Tyr Val Glu Arg He Val Trp Gly Gin Gly Gly Gly His 

130 135 140 

Thr Leu Lys Val Phe Asp Ser Thr Leu Gly Lys Val Gly Gly Leu Ala 
145 150 i55 160 

Cys Trp Glu His Thr Met Asn Leu Ala Arg His Ala Leu He Ala Gin 

165 170 175 

Gly He Gin He His Ala Ala Ala Trp Pro Gly Leu Ser Thr Met Ala 

180 185 190 

Gly Phe Glu Ala Val Ala Asp Val Gin He Asp Ala Met Met Lys Thr 

195 200 205 

His Ala Leu Ser Ala Gin Cys Phe Val Val Ser Ala Ala Asn Pro Val 

210 215 220 

Asp Gin Thr Cys Leu Glu Trp Met Glu Lys His Leu Gly Pro Gin Gin 
225 230 235 240 

Leu Val Thr Ala Gly Gly Gly Trp Ser Ala He Val His Pro Phe Cys 

245 250 255 

Gly Tyr He Ala Ala Pro His Thr Gly Ala Glu Glu Lys Val Leu Val 

260 265 270 

Gly Glu He Asn Leu Asp Asp Leu Lys Gin Val Lys Val Trp Val Asp 

275 280 285 

Ser Ala Gly His Tyr Ala Arg Pro Glu Val Val Gin Leu Arg Asp Ala 

290 295' 300 

Leu Glu Ser Arg Gly Asn Tyr Arg Val Ala Leu Thr Arg Asp Ala Asp 
305 310 315 320 

Thr Phe Val Pro Leu Glu Asp Arg Val Arg Phe Ala Arg Gin Gin Asn 

325 330 335 

Ala Asp Leu Phe He Ser He His Ala Asp Ala Asn Ala Asn His Asp 

340 345 350 

Ala Arg Gly Ala Gly Phe Thr Ser Lys Val Glu Asn Leu Ser Thr Gly 
355 360 365 
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lie Leu Pro Gly Asp 
370 



<210> 145 
<211> 1014 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 145 

atgggcatca cccacccgaa ctacaaggtc gcagtggtcc aggctgcgcc ggtctggttg 60 

aacctcgagg caacggtcga gaagacaatc aggtatattg aagaggcggc caaggctgga 120 

gcgaagctga tagcgtttcc ggaaacctgg attccgggct atccatggca catttggatc 180 

ggaacgcccg catgggcaat cggtaagggc ttcgtccagc gctatttcga caactcgctc 240 

agctatgaca gcccgctcgc gcggcagatc gctgacgccg cagcaaagag caagatcacg 300 

gttgttctcg gcctctccga gcgcgacggt ggaagcctat acatcgcgca atggctgatc 360 

ggaccagatg gcgagaccat cgcgaagcgg cgcaagctgc gtccgaccca cgtcgagcgc 420 

acggtgttcg gtgacggtga cggcagccac atcgccgtgc atgaccgatc cgatctgggc 480 

cggctcgggg cgttgtgctg ctgggagcac gtgcagccgt tgacgaaatt cgcgatgtac 540 

gcgcagaacg agcaggttca cgtggcagca tggccgagct tctcgatgta cgaacccttt 600 

gcgcatgcgc tgggttggga gacgaacaac gcggtcagca aggtctacgc ggtcgaggga 660 

tcgtgcttcg tgctcgctcc ctgtgccgtt atttcgcaag cgatggtgga cgagatgtgc 720 

gacactcccg acaagcgcga gcttgttcac gccggcggcg gccacgcggt gatttacggc 780 

cctgacggaa gcccgctcgc agaaaagctc ggggaaaacg aagaggggct tctctacgcg 840 

acggtcaatc ttgctgcgat cggggttgcc aagaatgccg cggatccggc cgggcactat 900 

tcgcgtccgg acgttctaag gctgctattc aacaagagcc cggcccgaag agtggagcat 960 

tttgcgctgc cgcacgagca gctcgagatc ggggcaggcc cgtctggcga ctga 1014 

<210> 146 
<211> 337 
<212> PRT 
<213> Unknown 



<220> 

<223> "Obtained from an environmental sample 



<400> 146 



Met 


Gly 


He 


Thr 


His 


Pro 


Asn 


Tyr 


1 








5 








Pro 


Val 


Trp 


Leu 
20 


Asn 


Leu 


Glu 


Ala 


lie 


Glu 


Glu 
35 


Ala 


Ala 


Lys 


Ala 


Gly 
40 


Thr 


Trp 
50 


He 


Pro 


Gly 


Tyr 


Pro 
55 


Trp 


Trp 


Ala 


He 


Gly 


Lys 


Gly 


Phe 


Val 


65 










70 






Ser 


Tyr 


Asp 


Ser 


Pro 
85 


Leu 


Ala 


Arg 


Ser 


Lys 


He 


Thr 
100 


Val 


Val 


Leu 


Gly 


Leu 


Tyr 


He 


Ala 


Gin 


Trp 


Leu 


He 




115 










120 


Lys 


Arg 
130 


Arg 


Lys 


Leu 


Arg 


Pro 
135 


Thr 


Asp 


Gly 


Asp 


Gly 


Ser 


His 


He 


Ala 


145 










150 






Arg 


Leu 


Gly 


Ala 


Leu 


Cys 


Cys 


Trp 



165 



Lys 


Val 


Ala Val Val 


Gin 


Ala 


Ala 


10 






15 




Thr 


Val 


Glu Lys Thr 


He Arg 


Tyr 


25 






30 






Ala 


Lys 


Leu He Ala 


Phe 


Pro 


Glu 




45 








His 


He 


Trp He Gly 


Thr 


Pro 


Ala 






60 








Gin Arg 


Tyr Phe Asp 


Asn 


Ser 


Leu 






75 






80 


Gin 


He 


Ala Asp Ala 


Ala 


Ala 


Lys 




90 






95 




Leu 


Ser 


Glu Arg Asp 


Gly Gly 


Ser 


105 






110 






Gly Pro 


Asp Gly Glu 


Thr 


He 


Ala 






125 








His 


Val 


Glu Arg Thr 


Val 


Phe 


Gly 






140 








Val 


His 


Asp Arg Ser 


Asp 


Leu 


Gly 






155 






160 


Glu 


His 


Val Gin Pro 


Leu 


Thr 


Lys 




170 






175 
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Phe 


Ala 


Met 


Tvr 


Ala 


Gin 


Asn 


Glu 


Gin 


Val 


His Val Ala Ala 


Trp 


Pro 








180 










185 




190 






Ser 


Phe 


Ser 


Met 


Tvr 


Glu 


Pro 


Phe 


Ala 


His 


Ala Leu Gly Trp 


Glu 


Thr 






195 










200 






205 






Asn 


Asn 


Ala 


Val 


Ser 


Lvs 


Val 


Tvr 


Ala 


Val 


Glu Gly Ser Cys 


Phe 


Val 




210 










215 








220 






Li&U 


Ala 


Pro 


Cys 


Ala 


Val 


lie 


Ser 


Gin 


Ala 


Met Val Asp Glu 


Met 


Cvs 


225 








230 










235 




240 


Asp 


Thr 


Pro 


Asp 


Lys 


Ara 


Glu 


Leu 


Val 


His 


Ala Gly Gly Gly 


His 


Ala 








245 










250 




255 




Val 


lie 


Tyr 


Gly 


Pro 


Asp 


Gly 


Ser 


Pro 


Leu 


Ala Glu Lys Leu 


Gly 


Glu 






260 










265 




270 






Asn 


Glu 


Glu Gly 


Leu 


Leu 


Tyr 


Ala 


Thr 


Val 


Asn Leu Ala Ala 


He 


Gly 






275 










280 






285 






Val 


Ala 


Lys 


Asn 


Ala 


Ala 


Asp 


Pro 


Ala 


Gly 


His Tyr Ser Arg 


Pro 


Asp 




290 










295 








300 






Val 


Leu 


Arg 


Leu 


Leu 


Phe 


Asn 


Lys 


Ser 


Pro 


Ala Arg Arg Val 


Glu 


His 


305 










310 










315 




320 


Phe 


Ala 


Leu 


Pro 


His 


Glu 


Gin 


Leu 


Glu 


He 


Gly Ala Gly Pro 


Ser 


Gly 










325 










330 




335 




Asp 



























<210> 147 
<211> 1098 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 147 

atgacccagc acgagaccac tgcccggagg ctggcagctg tgcatgccgc gcctgtgttc 60 

atggacaccg acgcgaccat cgacaaggtg atcggcttcg tcgaacaggc cggccgcgaa 120 

ggcatcgaac tcctggtgtt ccccgagacc ttcgtgcctg gttaccccta ctggatcgag 180 

tgctatccgc cgctgcagca ggtggccgcc aacgcgcagt acacggacgc ctccgtcgag 240 

gtgcctggtc cggagatcaa gcgggtgcag gcggcctgtg cccgcgctgg cgtcgaagtc 300 

gtcctcggcg tcagcgagcg actcaggggt accaggacat gcttcaactc ccaggtgttc 360 

atcgacgccg acgggagcct gctcggcgtg caccgcaagc tgcagccgac gtacgtggag 420 

cgcatcgtgt gggcccaggg cggaggcgcg accctgtcgg tgttcggctc ccgctccggc 480 

cggatcggcg gtctggcctg ctgggagcac acgatgaacc tggctcgtca ggcactgctt 540 

gagcaggagc agcagatcca cgcggcggcg tggcctgccc tgtcgacgat ggcggggttc 600 

gagaccgtcg cggacgccca gatcgaggcc atgatgaaga cccatgcgct cacggcacag 660 

gtgttcgtca tctgcgcgtc caacccggtc gacggcactt gcctggaatg gatgcgggac 720 

aacctcggtg aacagaagtt cgtgaccgcc ggagggggct ggtccgcggt catccacccc 780 

ttcaactcct tcctcggcgg gccgcatacc ggtttggagg agaagctcgt cagcgcgacg 840 

atcgacttct ccgacatccg cttggtcaag gcctgggttg attcgaaggg gcactacgcg 900 

cggcccgagg tcctgcgact cgcggtcgac cgcaagccac tgtggcacga cgagtgcgag 960 

gtgccgggac aggcgcaggt acgcacccgc gctgcttctc tggcagtgca ggagcacccg 1020 

gtggtgctgc ctcagggggc ggcgcggccc gctccgcaag actgggacac ctctgcggcg 1080 

caggagctga cttcctga 1098 

<210> 148 
<211> 365 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 148 
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Met Thr Gin His Glu Thr Thr Ala Arg Arg Leu Ala Ala Val His Ala 

1 5 10 15 

Ala Pro Val Phe Met Asp Thr Asp Ala Thr He Asp Lys Val He Gly 

20 25 30 

Phe Val Glu Gin Ala Gly Arg Glu Gly He Glu Leu Leu Val Phe Pro 

35 40 45 

Glu Thr Phe Val Pro Gly Tyr Pro Tyr Trp He Glu Cys Tyr Pro Pro 

50 55 60 

Leu Gin Gin Val Ala Ala Asn Ala Gin Tyr Thr Asp Ala Ser Val Glu 
65 70 75 80 

Val Pro Gly Pro Glu He Lys Arg Val Gin Ala Ala Cys Ala Arg Ala 

85 90 95 

Gly Val Glu Val Val Leu Gly Val Ser Glu Arg Leu Arg Gly Thr Arg 

100 105 HO 

Thr Cys Phe Asn Ser Gin Val Phe He Asp Ala Asp Gly Ser Leu Leu 

115 120 125 

Gly Val His Arg Lys Leu Gin Pro Thr Tyr Val Glu Arg He Val Trp 

130 135 140 

Ala Gin Gly Gly Gly Ala Thr Leu Ser Val Phe Gly Ser Arg Ser Gly 
145 150 155 160 

Arq He Gly Gly Leu Ala Cys Trp Glu His Thr Met Asn Leu Ala Arg 

165 170 175 

Gin Ala Leu Leu Glu Gin Glu Gin Gin He His Ala Ala Ala Trp Pro 

180 185 190 

Ala Leu Ser Thr Met Ala Gly Phe Glu Thr Val Ala Asp Ala Gin He 

195 200 205 

Glu Ala Met Met Lys Thr His Ala Leu Thr Ala Gin Val Phe Val He 

210 215 220 

Cys Ala Ser Asn Pro Val Asp Gly Thr Cys Leu Glu Trp Met Arg Asp 
225 230 235 240 

Asn Leu Gly Glu Gin Lys Phe Val Thr Ala Gly Gly Gly Trp Ser Ala 

245 250 255 

Val He His Pro Phe Asn Ser Phe Leu Gly Gly Pro His Thr Gly Leu 

260 265 270 

Glu Glu Lys Leu Val Ser Ala Thr He Asp Phe Ser Asp He Arg Leu 

275 280 285 

Val Lys Ala Trp Val Asp Ser Lys Gly His Tyr Ala Arg Pro Glu Val 

290 295 300 

Leu Arg Leu Ala Val Asp Arg Lys Pro Leu Trp His Asp Glu Cys Glu 
305 310 315 320 

Val Pro Gly Gin Ala Gin Val Arg Thr Arg Ala Ala Ser Leu Ala Val 

325 330 335 

Gin Glu His Pro Val Val Leu Pro Gin Gly Ala Ala Arg Pro Ala Pro 

340 345 350 

Gin Asp Trp Asp Thr Ser Ala Ala Gin Glu Leu Thr Ser 
355 360 365 

<210> 149 
<211> 942 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 149 

atgacgaagc ttgagaaggt ggtcgcggcg gcggtccagg cgacgccgga gttcctcgac 
cgcgaggcga ccgtcgagaa ggccgtgcgg ctgatcaagg aagcggccgg ggagggcgcc 
ggcctgatcg tgttccccga gacgttcatc ccgacgtacc cggactgggt ctggcgcgcg 
ccggcctggg acggcccatc cgcggacctg tacgcaatgc tgctggagaa cgcggtggag 240 
atccccgggc cggtgacgga gaccctgggg aaggcggcga agcaggccaa ggccttcgtg 300 



60 
120 
180 



Page 98 



WO 03/097810 PCT/US03/15712 



tcgatgggcg tcaacgagcg cgagccgggc ggcgggacga tctacaacac gcaggtcacg 360 

ttcggacccg acgggagcgt gctcggcaag caccgcaagc tgatgccgac cggcggcgag 420 

cgcctggtgt gggggatggg cgacgggtcg atgctccagg tctatgacac gccgttcggc 480 

cgcctgggcg ggctgatctg ctgggagaac tacatgccgc tcgcgcgcta ctcgatgtac 540 

gccaagggcg tggacgtcta cgttgcgccg acgtgggaca acagcgacat gtgggtggcg 600 

acgctccgcc acatcgccaa ggaggggcgg ctgtacgtga tcggcgtggc gccgctgctg 660 

cgcgggtcgg acgtccccga cgacgtgccg gggaaggccg agctgtgggg cggcgatgac 720 

gactggatgt cgcgcggctt ctccaccatc gtcgcgccgg gcggcgaggt gctggccggt 780 

ccgctgacgg aggaggaagg catcctctac gcggagatcg acccggcgag agcccgttcg 840 

tcacggcacc agttcgatcc ggtggggcac tactcgcgcc ccgacgtgtt tcggctcgtc 900 

gtggacgagt cgcccaagcc ccagacgtcc ggcccgggct ag 942 

<210> 150 
<211> 313 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 150 



Met 


Thr 


Lys 


Leu Glu 


Lys 


Val Val 


Ala Ala Ala Val Gin Ala Thr Pro 


1 






5 






10 15 


Glu 


Phe 


Leu 


Asp Arg 


Glu 


Ala Thr 


Val Glu Lys Ala Val Arg Leu He 








20 






25 30 


Lys 


Glu 


Ala 


Ala Gly 


Glu 


Gly Ala 


Gly Leu He Val Phe Pro Glu Thr 




35 






40 


45 


Phe 


lie 


Pro 


Thr Tyr 


Pro 


Asp Trp 


Val Trp Arg Ala Pro Ala Trp Asp 




50 








55 


60 


Gly 


Pro 


Ser 


Ala Asp 


Leu 


Tyr Ala 


Met Leu Leu Glu Asn Ala Val Glu 


65 








70 




75 80 


lie 


Pro 


Gly 


Pro Val 


Thr 


Glu Thr 


Leu Gly Lys Ala Ala Lys Gin Ala 






85 






90 95 


Lys 


Ala 


Phe 


Val Ser 


Met 


Gly Val 


Asn Glu Arg Glu Pro Gly Gly Gly 






100 






105 110 


Thr 


He 


Tyr 


Asn Thr 


Gin 


Val Thr 


Phe Gly Pro Asp Gly Ser Val Leu 






115 






120 


125 


Gly 


Lys 


His 


Arg Lys 


Leu 


Met Pro 


Thr Gly Gly Glu Arg Leu Val Trp 




130 








135 


140 


Gly 


Met 


Gly 


Asp Gly 


Ser 


Met Leu 


Gin Val Tyr Asp Thr Pro Phe Gly 


145 








150 




155 160 


Arg 


Leu 


Gly 


Gly Leu 


He 


Cys Trp 


Glu Asn Tyr Met Pro Leu Ala Arg 






165 






170 175 


Tyr 


Ser 


Met 


Tyr Ala 


Lys 


Gly Val 


Asp Val Tyr Val Ala Pro Thr Trp 






180 




185 190 


Asp 


Asn 


Ser 


Asp Met 


Trp 


Val Ala 


Thr Leu Arg His He Ala Lys Glu 




195 






200 


205 


Gly 


Arg 


Leu 


Tyr Val 


He 


Gly Val 


Ala Pro Leu Leu Arg Gly Ser Asp 




210 








215 


220 


Val 


Pro 


Asp 


Asp Val 


Pro 


Gly Lys 


Ala Glu Leu Trp Gly Gly Asp Asp 


225 








230 




235 240 


Asp 


Trp 


Met 


Ser Arg 


Gly 


Phe Ser 


Thr He Val Ala Pro Gly Gly Glu 






245 






250 255 


Val 


Leu 


Ala 


Gly Pro 


Leu 


Thr Glu 


Glu Glu Gly lie Leu Tyr Ala Glu 








260 






265 270 


lie 


Asp 


Pro 


Ala Arg 


Ala 


Arg Ser 


Ser Arg His Gin Phe Asp Pro Val 






275 






280 


285 


Gly 


His 


Tyr 


Ser Arg 


Pro 


Asp Val 


Phe Arg Leu Val Val Asp Glu Ser 


290 








295 


300 


Pro 


Lys 


Pro 


Gin Thr 


Ser 


Gly Pro 


Gly 


305 








310 
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<210> 151 
<211> 993 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 151 

atgagagtcg 

acagtcgata 

acgttcccgg 

caaatccttg 

cttgccaccg 

gtcaacgagc 

acgttgattc 

cagggcgatg 

gcttgttttg 

atccactctg 

atcaacgtaa 

ctggatgccg 

tcgggcggtt 

cactcgggcg 

cacgtggtgg 

cggactccca 

gactcgcaga 



ttaaagccgc 
acgtcgtcaa 
aaaccgtggt 
ccggcggtga 
aagcgatcgg 
gtgacggggg 
aaagacgacg 
gctcaggttt 
agcactacaa 
cgatgttccc 
ggcagcatgc 
accaacaggc 
gcttcaccgc 
aaggcgtttg 
acacgcgcgg 
cggcacacat 
atgtattcac 



cgcggtccaa 
gaagatccac 
gccttactat 
gttcctaaag 
cgaggcctgc 
aactctgtac 
caagatcacg 
acgggcggtt 
cccgcttgcg 
gggctccatg 
aatggagtct 
acaaatcatg 
tatcgtgaca 
tattgccgac 
ccactacagc 
acacgaacgg 
cgctattgct 



ctgaaaccag 
gagctgggcc 
ccgtactttt 
ctgcttgatc 
aggcaagcgg 
aatacgcaac 
cccacccatt 
gacagcaagg 
cgttacgcca 
ttcggcgatg 
ggatgctttg 
aaggacacag 
cccgacggga 
ctcgatttca 
cggccagaat 
aacgagcaac 
taa 



tcctttatag 
aacaaggagt 
cgatcgtgca 
agtcagtgac 
gcgtcgttgt 
ttctctttga 
acgagcgcat 
tcgcgcgcat 
tgatggccga 
cgttttcaga 
tcgtctgcgc 
gctgcgagat 
cgctgatagg 
agctcatcga 
tgctcagcct 
cgaagtcggc 



ccgtgaggga 
gcagttcgca 
gtccggctat 
cgtgccatct 
ctccatcggc 
tgccgacggc 
ggtctggggc 
tggtcaactg 
tggcgagcaa 
gaaggtggaa 
tacggcctgg 
cggtccgatc 
cgaacccatc 
caagcggaag 
cctaattgat 
cgttgagcaa 



<210> 152 
<211> 330 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
993 



<400> 152 

Met Arg Val Val Lys Ala Ala Ala Val Gin Leu Lys Pro Val Leu Tyr 

15 10 15 

Ser Arg Glu Gly Thr Val Asp Asn Val Val Lys Lys He His Glu Leu 

20 25 30 

Gly Gin Gin Gly Val Gin Phe Ala Thr Phe Pro Glu Thr Val Val Pro 

35 40 45 

Tyr Tyr Pro Tyr Phe Ser He Val Gin Ser Gly Tyr Gin He Leu Ala 

50 55 60 

Gly Gly Glu Phe Leu Lys Leu Leu Asp Gin Ser Val Thr Val Pro Ser 
65 70 75 80 

Leu Ala Thr Glu Ala He Gly Glu Ala Cys Arg Gin Ala Gly Val Val 

85 90 95 

Val Ser He Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn Thr 

100 ■ 105 HO 

Gin Leu Leu Phe Asp Ala Asp Gly Thr Leu He Gin Arg Arg Arg Lys 

115 120 125 

He Thr Pro Thr His Tyr Glu Arg Met Val Trp Gly Gin Gly Asp Gly 

130 135 140 

Ser Gly Leu Arg Ala Val Asp Ser Lys Val Ala Arg He Gly Gin Leu 
145 150 155 160 

Ala Cys Phe Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Met Met Ala 

165 170 175 

Asp Gly Glu Gin He His Ser Ala Met Phe Pro Gly Ser Met Phe Gly 
. 180 185 190 
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Asp Ala 


Phe 
195 


Ser 


Glu 


Lys 


Val 


Glu 
200 


He 


Asn 


Val 


Arg 


Gin 

205 


His Ala 


Met 


Glu Ser 


Gly 


Cys 


Phe 


Val 


Val 


Cys 


Ala 


Thr 


Ala 


Trp 


Leu 


Asp Ala 


Asp 


210 










215 










220 








Gin Gin 


Ala 


Gin 


He 


Met 


Lys 


Asp 


Thr 


Gly 


Cys 


Glu 


He 


Gly Pro 


He 


225 








230 










235 








240 


Ser Gly 


Gly 


Cys 


Phe 
245 


Thr 


Ala 


He 


Val 


Thr 
250 


Pro 


Asp 


Gly 


Thr Leu 
255 


He 


Gly Glu 


Pro 


He 
260 


His 


Ser 


Gly 


Glu 


Gly 
265 


Val 


Cys 


He 


Ala 


Asp Leu 
270 


Asp 


Phe Lys 


Leu 


He Asp 


Lys 


Arg 


Lys 


His 


Val 


Val 


Asp 


Thr 


Arg Gly 


His 




275 










280 










285 






Tyr Ser 


Arg 


Pro 


Glu 


Leu 


Leu 


Ser 


Leu 


Leu 


He 


Asp 


Arg 


Thr Pro 


Thr 


290 










295 










300 








Ala His 


lie 


His 


Glu 


Arg 


Asn 


Glu 


Gin 


Pro 


Lys 


Ser 


Ala 


Val Glu 


Gin 


305 








310 










315 








320 


Asp Ser 


Gin 


Asn 


Val 

325 


Phe 


Thr 


Ala 


He 


Ala 
330 













<210> 153 
<211> 1074 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 153 

atgccaaacg caagaaagat tgttggagcc gtggcccaag ttgcacagga attcttcgac 60 

actgaagcga atctcggtaa agcgatagcg gcgattcaca atgctgcgaa gcaaggcgca 120 

gatatcgtcg tcttcgccga atgctatttg ggccaatatc catattgggc gcaattttac 180 

gacaactctg ccaagaacta ttccaaggtt tggacggccc tgtacgacgg tgcgatcact 240 

gtgggtggcg atgaatgccg ggctattgct gctgcggcta gacagtccaa gattcatgtc 300 

gtcatgggtt gcaatgagct atccgaccga gccggcggcg caacgttata caacagcctc 360 

ttgtttttcg accgaaaggg cgagttgatc ggtcgacacc ggaaattgat gccgtcgatg 420 

cacgagcggt tgatccatgg cacaggcgac ggaagagact tgaatgttta cgataccgat 480 

atcggtatgt tgggtgggtt gatttgctgg gagcaccata tgtcgctctc gaagtatgcc 540 

atggcgacta tgggtgaaga agttcatgtt gcaagctggc ctgggatgtg gcgcggagga 600 

gacgcggcaa tcggtgagag gatggtcgaa gcggatcttg gggcgccgtt tgtttgtgac 660 

gccgaatttg cgatccgaga atatgcggca gagacaggaa atttcgttct aagcgcgtct 720 

ggatattttc cgaaggacaa tatatccgat gagtggcgcg aagcgattcc aaaccttcaa 780 

gcgcagtggg ctgtgggcgg gagttctatc gtggcaccgg ggggctccta tctggtccca 840 

ccactcatta atgaggagaa gatcctctgc gccgaactcg atttcaatct caggcgtctt 900 

tggaaagcct ggatcgatcc gattggtcac tattcgcgtc ccgatgttta tagcctgcaa 960 

ctgcataacg ttgctgggcg tgagtattcc tatcaggccg tagatttgaa gcgcacgcca 1020 

aagccccaat cgctgtgggt agatgcgtcc gaggaagacg gtgcgctgaa ttga 1074 

<210> 154 

<211> 357 

<212> PRT 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 154 

Met Pro Asn Ala Arg Lys He Val Gly Ala Val Ala Gin Val Ala Gin 

15 10 15 

Glu Phe Phe Asp Thr Glu Ala Asn Leu Gly Lys Ala He Ala Ala He 

20 25 30 

His Asn Ala Ala Lys Gin Gly Ala Asp He Val Val Phe Ala Glu Cys 
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35 


40 


45 




Tyr 


Leu 


Gly Gin Tyr Pro 


Tyr Trp Ala Gin 


Phe Tyr Asp Asn Ser 


Ala 




50 




55 


60 




Lys 


Asn 


Tyr Ser Lys Val 


Trp Thr Ala Leu 


Tyr Asp Gly Ala He 


Thr 


65 




70 




75 


80 


Val 


Gly 


Gly Asp Glu Cys 


Arg Ala He Ala 


Ala Ala Ala Arg Gin 


Ser 






85 


90 


95 




Lys 


He 


His Val Val Met 


Gly Cys Asn Glu 


Leu Ser Asp Arg Ala 


Gly 






100 


105 


110 . 


Gly 


Ala 


Thr Leu Tyr Asn 


Ser Leu Leu Phe 


Phe Asp Arg Lys Gly 


Glu 






115 


120 


125 




Leu 


He 


Gly Arg His Arg 


Lys Leu Met Pro 


Ser Met His Glu Arg 


Leu 




130 




135 


140 




lie 


His 


Gly Thr Gly Asp 


Gly Arg Asp Leu 


Asn Val Tyr Asp Thr 


Asd 


145 




150 




155 


160 


He 


Gly 


Met Leu Gly Gly 


Leu He Cys Trp 


Glu His His Met Ser 


Leu 






165 


170 


175 




Ser 


Lys 


Tyr Ala Met Ala 


Thr Met Gly Glu 


Glu Val His Val Ala 


Ser 






180 


185 


190 




Trp 


Pro 


Gly Met Trp Arg 


Gly Gly Asp Ala 


Ala He Gly Glu Arg 


Met 






195 


200 


205 




Val 


Glu 


Ala Asp Leu Gly 


Ala Pro Phe Val 


Cys Asp Ala Glu Phe 


Ala 




210 




215 


220 




He 


Arg 


Glu Tyr Ala Ala 


Glu Thr Gly Asn 


Phe Val Leu Ser Ala 


Ser 


225 




230 




235 


240 


Gly 


Tyr 


Phe Pro Lys Asp 


Asn He Ser Asp 


Glu Trp Arg Glu Ala 


He 






245 


250 


255 




Pro 


Asn 


Leu Gin Ala Gin 


Trp Ala Val Gly 


Gly Ser Ser He Val 


Ala 






260 


265 


270 




Pro 


Gly 


Gly Ser Tyr Leu 


Val Pro Pro Leu 


He Asn Glu Glu Lys 


He 






275 


280 


285 




Leu 


Cys 


Ala Glu Leu Asp 


Phe Asn Leu Arg 


Arg Leu Trp Lys Ala 


Trp 




290 




295 


300 


He 


Asp 


Pro He Gly His 


Tyr Ser Arg Pro 


Asp Val Tyr Ser Leu 


Gin 


305 




310 




315 


320 


Leu 


His 


Asn Val Ala Gly 


Arg Glu Tyr Ser 


Tyr Gin Ala Val Asp 


Leu 






325 


330 


335 




Lys 


Arg 


Thr Pro Lys Pro 


Gin Ser Leu Trp 


Val Asp Ala Ser Glu 


Glu 






340 


345 


350 




Asp 


Gly 


Ala Leu Asn 









355 



<210> 155 
<211> 1041 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 155 

atgggcatcg aacatccgaa atacaaggtg gccgtggtgc aggccgcgcc 
gatctcgacg gctcgatcaa gaaggcgatt gcgctgatcg aggaagcggc 
gctaagctga tcgctttccc cgaaaccttc attcccggct atccctggca 
gactcgccgg cctgggcgat cggccgcggc tttgtgcagc gctacttcga 
gcctacgaca gcccgcaagc cgaaaagctg cgcgccgcgg tcaagaaggc 
gccgtgattg gcctgtcgga gcgcgacggc ggcagcctct atatagcgca 
ggccctgatg gcgagaccat cgcaaaacgc agaaagctgc ggccaacgca 
accgtttttg gcgagggtga cggcagcgac cttgccgtgc acgaccggcc 
cggctgggag cgctgtgctg ctgggagcac ctgcaaccgc tttcgaaata 
gcgcagaacg aacaggtcca tgtcgcgtca tggccgagct tctcgctcta 



ggcctggctc 60 

cgccaagggc 120 

catctggctg 180 

taactcgctg 240 

caagctcact 300 

atggctgatt 360 

cgcggaacgc 420 

cggaatcggg 480 

cgcgatgtat 540 

cgaccccttc 600 
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gcgccggcgc tcggcgccga ggtcaacaat gcggcttccc gcgtctacgc ggtcgagggc 660 

tcgtgcttcg tgctggcgcc gtgcgccacg gtttcgcaag ccatgatcga cgagctgtgt 720 

gaccggccgg acaagcatgc gctgttgcac gccggtggcg gacacgccgc gatttacggc 780 

ccggacggca gctcgatcgc ggagaagctg ccgcaggacg cggagggcct gttgatcgcc 840 

gagatcgatc tcggggcgat cggggttgcc aagaatgcag ccgacccggc cggtcattat 900 

tcgcggccgg acgtgacgcg actcctgctg aacaagaacc ggatgcgaag ggtcgaggag 960 

tttgcgctgc cggtcgatcc ggtcgcaacg accgaggagg agcaagtcgc gacgccgtcg 1020 

aggcccagcc aggccgcgta a 1041 

<210> 156 
<211> 346 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 156 



Met 


Gly 


He 


Glu 


His 


Pro 


Lys 


Tyr 


Lys 


Val 


Ala Val Val Gin Ala 


Ala 


1 








5 










10 


15 




Pro 


Ala 


Trp 


Leu 


Asp 


lieu 


Asp 


Gly 


Ser 


He 


Lys Lys Ala He Ala 


Leu 








20 










25 




30 




He 


Glu 


Glu 


Ala 


Ala 


Ala 


Lys 


Gly 


Ala 


Lys 


Leu He Ala Phe Pro 


Glu 






35 










40 






45 




Thr 


Phe 


He 


Pro 


Gly 


Tyr 


Pro 


Trp 


His 


He 


Trp Leu Asp Ser Pro 


Ala 




50 










55 








60 




Trp 


Ala 


He 


Gly 


Arg 


Gly 


Phe 


Val 


Gin 


Arg 


Tyr Phe Asp Asn Ser 


Leu 


65 










70 










75 


80 


Ala 


Tyr 


Asp 


Ser 


Pro 


Gin 


Ala 


Glu 


Lys 


Leu 


Arg Ala Ala Val Lys 


Lys 










85 










90 


95 




Ala 


Lys 


Leu 


Thr 


Ala 


Val 


He 


Gly 


Leu 


Ser 


Glu Arg Asp Gly Gly Ser 








100 










105 




110 




Leu 


Tyr 


He 


Ala 


Gin 


Trp 


Leu 


He 


Gly 


Pro 


Asp Gly Glu Thr He Ala 






115 










120 






125 




Lys 


Arg 


Arg 


Lys 


Leu 


Arg 


Pro 


Thr 


His 


Ala 


Glu Arg Thr Val Phe 


Gly 




130 










135 








140 




Glu 


Gly 


Asp 


Gly 


Ser 


Asp 


Leu 


Ala 


Val 


His 


Asp Arg Pro Gly He 


Gly 


145 










150 










155 


160 


Arg 


Leu 


Gly 


Ala 


Leu 


Cys 


Cys 


Trp 


Glu 


His 


Leu Gin Pro Leu Ser 


Lys 










165 










170 


175 




Tyr 


Ala 


Met 


Tyr 


Ala 


Gin 


Asn 


Glu 


Gin 


Val 


His Val Ala Ser Trp 


Pro 








180 










185 




190 




Ser 


Phe 


Ser 


Leu 


Tyr 


Asp 


Pro 


Phe 


Ala 


Pro 


Ala Leu Gly Ala Glu Val 






195 










200 






205 




Asn 


Asn 


Ala 


Ala 


Ser 


Arg 


Val 


Tyr 


Ala 


Val 


Glu Gly Ser Cys Phe 


Val 




210 










215 








220 




Leu 


Ala 


Pro 


Cys 


Ala 


Thr 


Val 


Ser 


Gin 


Ala 


Met He Asp Glu Leu 


Cys 


225 










230 










235 


240 


Asp 


Arg 


Pro 


Asp 


Lys 


His 


Ala 


Leu 


Leu 


His 


Ala Gly Gly Gly His 


Ala 










245 










250 


255 




Ala 


He 


Tyr 


Gly 


Pro 


Asp 


Gly 


Ser 


Ser 


He 


Ala Glu Lys Leu Pro 


Gin 








260 










265 




270 




Asp 


Ala 


Glu 


Gly 


Leu 


Leu 


He 


Ala 


Glu 


He 


Asp Leu Gly Ala He 


Gly 






275 










280 






285 




Val 


Ala 


Lys 


Asn 


Ala 


Ala 


Asp 


Pro 


Ala 


Gly 


His Tyr Ser Arg Pro 


Asp 




290 










295 








300 




Val 


Thr 


Arg 


Leu 


Leu 


Leu 


Asn 


Lys 


Asn 


Arg 


Met Arg Arg Val Glu 


Glu 


305 










310 










315 


320 


Phe 


Ala 


Leu 


Pro 


Val 


Asp 


Pro 


Val 


Ala 


Thr 


Thr Glu Glu Glu Gin 


Val 










325 










330 


335 




Ala 


Thr 


Pro 


Ser 


Arg 


Pro 


Ser 


Gin 


Ala 


Ala 
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340 345 

<210> 157 
<211> 1011 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 157 

atgagagtcg ttaaagctgc tgcggtccaa ctgagtcccg tgctgtatag ccgtgaggga 60 

acagtagaaa aggtcgttcg gaagatccac gagcttggcg atcaaggagt cgagttcgcc 120 

acgttcccgg agaccgtagt gccctactat ccgtacttct cggccgtcca gacgccgatt 180 

cagaacatgc acggcccgga gcacctgaag ttgctcgagc aatcggtgac cgtcccgtcg 240 

cccgccaccg acgcgatcgg cgacgcctgc cgccacgccg gcgtcgtcgt ctcgatcggc 300 

gtcaacgaac gcgatggcgg cacgatctac aacacgcagc tcctgttcga cgccgacggc 360 

accttgatcc agcgccggcg aaagatcacg ccgaccttct acgaacgaat ggtctgggga 420 

cagggtgacg gttcggggct gcgcgccgtc gacagccgcg taggacgcat cggccagctc 480 

gcctgtttcg agcactacaa cccgctggcg cgctacgcca tgatggccga cggcgagcag 540 

attcactccg cgatgtaccc cggctccatc tttggagacg cattcgcgca gaaaatcgag 600 

atcaacatcc gccagcacgc gctcgagtcc ggtgcgttcg tcgtcaacgc caccgcctgg 660 

ctcgatgccg accagcaggc gcggatcatg aaggataccg gctgcaccat cgaaccgatc 720 

tcgggcggtt gcttcaccgc catcgtcacc ccggacggga ccctgctggg cgaagcgata 780 

cgttcggggg agggagtggt ggtcgccgat ctcgacttca cgctgatcga caggcgcaag 840 

caagtgatgg actctcgtgg tcactacagt cggccggagt tgctcagcct tctgatcgac 900 

cgcacaccca ccgcacacct acacgaacgc gaagcgcacc ccagagcaag tgaggactgg 960 

caaggttccg agagtctgcg cgccatgcag gcctcggcac cgaaggtctg a 1011 

<210> 158 
<211> 336 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 158 



Met 


Arg 


Val 


Val 


Lys 


Ala 


Ala 


Ala 


Val 


Gin 


Leu 


Ser 


Pro 


Val 


Leu 


Tyr 


1 








5 










10 










15 




Ser 


Arg 


Glu 


Gly 
20 


Thr 


Val 


Glu 


Lys 


Val 
25 


Val 


Arg 


Lys 


He 


His 
30 


Glu 


Leu 


Gly 


Asp 


Gin 
35 


Gly 


Val 


Glu 


Phe 


Ala 
40 


Thr 


Phe 


Pro 


Glu 


Thr 
45 


Val 


Val 


Pro 


Tyr 


Tyr 


Pro 


Tyr 


Phe 


Ser 


Ala 


Val 


Gin 


Thr 


Pro 


He 


Gin 


Asn 


Met 


His 




50 








55 










60 










Gly 


Pro 


Glu 


His 


Leu 


Lys 


Leu 


Leu 


Glu 


Gin 


Ser 


Val 


Thr 


Val 


Pro 


Ser 


65 










70 










75 










80 


Pro 


Ala 


Thr 


Asp 


Ala 
85 


He 


Gly 


Asp 


Ala 


Cys 
90 


Arg 


His 


Ala 


Gly 


Val 
95 


Val 


Val 


Ser 


lie 


Gly 
100 


Val 


Asn 


Glu 


Arg 


Asp 
105 


Gly 


Gly 


Thr 


He 


Tyr 
110 


Asn 


Thr 


Gin 


Leu 


Leu 
115 


Phe 


Asp 


Ala 


Asp 


Gly 
120 


Thr 


Leu 


He 


Gin 


Arg 
125 


Arg 


Arg 


Lys 


lie 


Thr 
130 


Pro 


Thr 


Phe 


Tyr 


Glu 
135 


Arg 


Met 


Val 


Trp 


Gly 
140 


Gin 


Gly 


Asp 


Gly 


Ser 


Gly 


Leu 


Arg 


Ala 


Val 


Asp 


Ser 


Arg 


Val 


Gly 


Arg 


He 


Gly 


Gin 


Leu 


145 










150 










155 










160 


Ala 


-Cys 


Phe 


Glu 


His 


Tyr 


Asn 


Pro 


Leu 


Ala 


Arg 


Tyr 


Ala 


Met 


Met 


Ala 








165 










170 










175 




Asp Gly 


Glu 


Gin 


He 


His 


Ser 


Ala 


Met 


Tyr 


Pro 


Gly 


Ser 


He 


Phe 


Gly 
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180 








185 






190 




nap ma 


Phe Ala 


Gin 


Lys He 


Glu 


He 


Asn 


He Arg 


Gin His Ala 


Leu 


195 






200 








205 




Glu Ser 


Gly Ala 


Phe 


Val Val 


Asn 


Ala 


Thr 


Ala Trp 


Leu Asp Ala 


Asp 






215 








220 








nld rvx. y 


He 




Asp 


Thr 


Glv 


Cys Thr 


He Glu Pro 


He 


225 






230 








235 




240 






Phe 


Thr Ala 


He 


Val 


Thr 


Pro Asp 


Gly Thr Leu 


Leu 


245 








250 




255 




Gly Glu 


Ala He 


Arg 


Ser Gly 


Glu 


Gly 


Val 


Val Val 


Ala Asp Leu 


Asp 


260 








265 






270 




Phe Thr 


Leu He 


Asp 


Arg Arg 


Lys 


Gin 


Val 


Met Asp 


Ser Arg Gly 


His 




275 






280 








285 




Tyr Ser 


Arg Pro 


Glu 


Leu Leu 


Ser 


Leu 


Leu 


He Asp 


Arg Thr Pro 


Thr 


290 




295 








300 






Ala His 


Leu His 


Glu 


Arg Glu 


Ala 


His 


Pro 


Arg Ala 


Ser Glu Asp 


Trp 


305 






310 








315 




320 


Gin Gly 


Ser Glu 


Ser 


Leu Arg 


Ala 


Met 


Gin 


Ala Ser 


Ala Pro Lys 


Val 




325 








330 




335 





<210> 159 
<211> 930 
<212> DMA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 159 

atgtcatcaa ccgtgacggt tgecattatt caggcagcac ccgtgtatta tgacctgcct 60 

gccacgctgg acaaagccgc caaactggtg geggatgegg eggcacaggg cgcaacgctg 120 

attgtcttcg gcgagacatg gtttccgggg tatceggcat ggctggatta ctgccccaat 180 

gtcgcgctgt ggaatcatcc cccgaccaag caggtatttg agcgcctgca tcgcaacagc 240 

atcgctgtgc caagcaagga actcgatttt ctgggggcgc tggcacgcaa gcatcaggtg 300 

gtgctggtgt tgagcattaa tgaacgtgtg gagcagggcg eggggcatgg caegctgtat 360 

aacacgctgc tcacgattga cgccgatggc acgctggcaa atcatcatcg caaactgatg 420 

ccgacctata ccgagcgcat ggtgtggggc atgggcgacg gggtggggtt geaageggtg 480 

gatactgecg tegggegegt aggeggctta atctgctggg aacactggat gccgttggca 540 

cgccagacca tgcacatcag eggegaacag attcatattt ccgtcttccc aaccgtccat 600 

gagatgeace agattgecag ccgccagtat gectttgaag ggcggacgtt tgtgctgacc 660 

gttggcggca ttcttgegge acaggacttg cccgccgaac tggaacgccc cgccgatttg 720 

ccgcccacgc agcttgtcca gcgcggcggc agegecatta tcgcgccgga tggtcgttat 780 

ctggcgggtc cagtctataa tgaggaaacc atcctgaccg caacgctgga tttgggcgag 840 

atcatccgcg agagcatgac gctggatgtc aceggacatt atgcccgccc ggatgttttt 900 

gacctgaccg tgaagcgcag ccgaccatga 930 

<210> 160 
<211> 309 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 160 

Met Ser Ser Thr Val Thr Val Ala 

1 5 
Tyr Asp Leu Pro Ala Thr Leu Asp 
20 

Ala Ala Ala Gin Gly Ala Thr Leu 
35 40 



He He Gin Ala Ala Pro Val Tyr 

10 15 
Lys Ala Ala Lys Leu Val Ala Asp 
25 30 
He Val Phe Gly Glu Thr Trp Phe 
45 
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Pro Gly Tyr Pro Ala Trp Leu Asp Tyr Cys Pro Asn Val Ala Leu Trp 

50 55 60 

Asn His Pro Pro Thr Lys Gin Val Phe Glu Arg Leu His Arg Asn Ser 
65 70 75 80 

He Ala Val Pro Ser Lys Glu Leu Asp Phe Leu Gly Ala Leu Ala Arg 

85 90 95 

Lvs His Gin Val Val Leu Val Leu Ser He Asn Glu Arg Val Glu Gin 

100 105 HO 

Glv Ala Gly His Gly Thr Leu Tyr Asn Thr Leu Leu Thr He Asp Ala 

115 120 125 

Asp Gly Thr Leu Ala Asn His His Arg Lys Leu Met Pro Thr Tyr Thr 

130 135 140 

Glu Arg Met Val Trp Gly Met Gly Asp Gly Val Gly Leu Gin Ala Val 
145 150 155 160 

Asp Thr Ala Val Gly Arg Val Gly Gly Leu He Cys Trp Glu His Trp 

165 "0 1 7 5 

Met Pro Leu Ala Arg Gin Thr Met His He Ser Gly Glu Gin He His 

180 185 190 

He Ser Val Phe Pro Thr Val His Glu Met His Gin He Ala Ser Arg 

195 200 205 

Gin Tyr Ala Phe Glu Gly Arg Thr Phe Val Leu Thr Val Gly Gly He 

210 215 220 

Leu Ala Ala Gin Asp Leu Pro Ala Glu Leu Glu Arg Pro Ala Asp Leu 
225 230 235 240 

Pro Pro Thr Gin Leu Val Gin Arg Gly Gly Ser Ala He lie Ala Pro 

245 250 255 

Asp Gly Arg Tyr Leu Ala Gly Pro Val Tyr Asn Glu Glu Thr He Leu 

260 265 270 

Thr Ala Thr Leu Asp Leu Gly Glu He He Arg Glu Ser Met Thr Leu 

275 280 285 

Asp Val Thr Gly His Tyr Ala Arg Pro Asp Val Phe Asp Leu Thr Val 

290 295 300 

Lys Arg Ser Arg Pro 
305 

<210> 161 
<211> 1008 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<4Q0> 161 fnrf-ntantc acaccaaacc 60 



atgaccacca tccgcgccgc cgccgtgcag tttagcccgg tgctgtactc gcgccaggcc 
accgtcgaca agctgtgccg caccctgctg gaactgggcc gcgaaggggt gcagttcgcg 120 
gtattcccgg aaaccgtggt gccgtactac ccatattttt ccttcgtgca gccaccgttc 
gccatgggca aacaacacct gttgctgctc gagcaatccg tcactgtgcc ctctgacgtc 
acccggcaga tcggtgaggc ctgccgggaa gcggggatcg tcgccagcat cggcgtcaac 
qaacgcgacg gcggcactat ttataacgcg cagttgctgt tcgatgccga cggcagcctg 
attcaqcagc ggcgcaagat caccccgacc tatcacgaac gcatggtctg ggggcagggc 
gatggttccg gcctgcgcgc cgtggacagt gcggtggggc gtatcggttc cctggcctgc 
tgggaacatt acaaccccct ggcgcgctac gcgctgatgg ccgatggcga acagattcat 
gtggcgatgt ttcccggctc cctggtcggc gacatctttg ccgagcagat cgaagtcacc 
atccgccacc acgccctgga aagcggctgc ttcgtggtca acgccacggc ttggctggat 
gccgaccagc agggccggat catgcaggac accggctgcg agttggggcc gatttccggc 
agctgtttta ccgcgatcat ttccccggag ggcaaggttc tcggcgagcc gctgcgcagc 
ggcgaagggg tggtcattgc tgacctcgac ctggccctga tcgacaagcg caaacgcatg 
aLgattcgg tcggtcacta cagccgcccg gaactgctca gcctgcttat cgaccgcagc 
ccgaccgccc acgtgcatga acttgccgcc gcgcttaatc ctgccaggga gtctgatcca 
ctagtgtcga cctgcaggcg cgcgagctcc agcttttgtt ccctttag 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1008 
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<210> 162 
<211> 335 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 162 



Met 


Thr 


Thr 


He Arg 


Ala 


Ala 


Ala 


Val 


Gin 


Phe 


Ser 


Pro 


Val 


Leu 


Tyr 


1 






5 










10 










15 


Ser 


Arg 


Gin 


Ala Thr 
20 


Val 


Asp 


Lys 


Leu 
25 


Cys 


Arg 


Thr 


Leu 


Leu 
30 


Glu 


Leu 


Gly 


Arg 


Glu 
35 


Gly Val 


Gin 


Phe 


Ala 
40 


Val 


Phe 


Pro 


Glu 


Thr 
45 


Val 


Val 


Pro 


Tyr 


Tyr 


Pro 


Tyr Phe 


Ser 


Phe 


Val 


Gin 


Pro 


Pro 


Phe 


Ala 


Met 


Gly 


Lvs 




50 








55 










60 






Gin 


His 


Leu 


Leu Leu 


Leu 


Glu 


Gin 


Ser 


Val 


Thr 


Val 


Pro 


Ser 


Asp 


Val 


65 








70 










75 








80 


Thr 


Arg 


Gin 


He Gly 


Glu 


Ala 


Cys 


Arg 


Glu 


Ala 


Gly 


He 


Val 


Ala 


Ser 








85 










90 








95 




lie 


Gly 


Val 


Asn Glu 
100 


Arg 


Asp 


Gly 


Gly 
105 


Thr 


He 


Tyr 


Asn 


Ala 
110 


Gin 


Leu 


Leu 


Phe 


Asp 


Ala Asp 


Gly 


Ser 


Leu 


He 


Gin 


Gin 


Arg 


Arg 


Lys 


lie 


Thr 






115 








120 










125 






Pro 


Thr 


Tyr 


His Glu 


Arg 


Met 


Val 


Trp 


Gly 


Gin 


Gly 


Asd 


Glv 


Ser 


Glv 




130 








135 










140 








Leu 


Arg 


Ala 


Val Asp 


Ser 


Ala 


Val 


Gly 


Arg 


He 


Gly 


Ser 


Leu 


Ala 


Cvs 


145 








150 










155 










160 


Trp 


Glu 


His 


Tyr Asn 


Pro 


Leu 


Ala 


Arg 


Tyr 


Ala 


Leu 


Met 


Ala 


Asp 


Gly 








165 










170 










175 


Glu 


Gin 


He 


His Val 


Ala 


Met 


Phe 


Pro 


Gly 


Ser 


Leu 


Val 


Gly 


Asp 


He 








180 








185 










190 




Phe 


Ala 


Glu 
195 


Gin He 


Glu 


Val 


Thr 
200 


He 


Arg 


His 


His 


Ala 
205 


Leu 


Glu 


Ser 


Gly 


Cys 


Phe 


Val Val 


Asn 


Ala 


Thr 


Ala 


Trp 


Leu 


Asp 


Ala 


Asp 


Gin 


Gin 




210 








215 










220 








Gly 


Arg 


He 


Met Gin 


Asp 


Thr 


Gly 


Cys 


Glu 


Leu 


Gly 


Pro 


He 


Ser 


Gly 


225 








230 










235 










240 


Gly 


Cys 


Phe 


Thr Ala 
245 


He 


He 


Ser 


Pro 


Glu 
250 


Gly 


Lys 


Val 


Leu 


Gly 
255 


Glu 


Pro 


Leu 


Arg 


Ser Gly 
260 


Glu 


Gly 


Val 


Val 
265 


He 


Ala 


Asp 


Leu 


Asp 
270 


Leu 


Ala 


Leu 


lie 


Asp 


Lys Arg 


Lys 


Arg 


Met 


Met 


Asp 


Ser 


Val 


Gly 


His 


Tyr 


Ser 






275 








280 










285 






Arg 


Pro 
290 


Glu 


Leu Leu 


Ser 


Leu 
295 


Leu 


He 


Asp 


Arg 


Ser 
300 


Pro 


Thr 


Ala 


His 


Val 


His 


Glu 


Leu Ala 


Ala 


Ala 


Leu 


Asn 


Pro 


Ala 


Arg 


Glu 


Ser 


Asp 


Pro 


305 








310 










315 








320 


Leu 


Val 


Ser 


Thr Cys 


Arg 


Arg 


Ala 


Ser 


Ser 


Ser 


Phe 


Cys 


Ser 


Leu 










325 










330 








335 





<210> 163 
<211> 978 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
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<400> 163 

gtgaccatca tcaaagccgc cgcagtgcag atcagccccg tgctttacag ccgggaagcc 60 

accgtcgaaa aggtcgttcg cgagacccgc gaactcggcc agaagggcgt gcagttcgca 120 

acgtttccgg aaaccgtggt gccgtactac ccatacttct ccgccgtcca gacgggcatc 180 

gaactgctgt ccggcaaaga gcacctgcga ctgctggagc aggccgtgac tgttccttcc 240 

cccgccactg atgcgattgc ccaggcggca cgcgaggccg gcatggtggt gtcgatcggc 300 

gtcaacgagc gtgacggcgg caccatctac aacacgcagc tgctctttga tgccgacggc 360 

acgctggtgc agcgccgccg caagatcacg ccgacgcatt tcgagcgcat ggtgtggggc 420 

cagggcgacg gttcgggcct gcgcgcagtg gataccaagg tcggccgcat tggccagctg 480 

gcctgcttcg agcacaacaa cccgctcgcg cgctacgcaa tgatggccga tggcgagcag 540 

atccattcct ccatgtaccc gggctccgcc ttcggcgacg gattcgcgca gcgcatggag 600 

atcaacattc gccaacacgc cctggagtcg ggttgcttcg tggtgaatgc caccgcgtgg 660 

ctcgacgccg accagcaggc gcagatcatg aaggacacgg gctgcgccat cgggccgatc 720 

tctggcggct gcttcacgac catcgtcacg ccggacggca tgctgatcgg cgaacccctc 780 

cgcgagggcg agggcgagat catcgccgac ctcgatttca ccctgatcga ccgccgcaag 840 

ctgctgatgg actcggtcgg ccactacaac cgtccggagc tgctgagcct gctgatcgac 900 

cgcacacccg cggcgaactt ccatgagcgc agtacgcatc cggccgtcga tgccgccagc 960 

ggcctcgaaa tcctctaa 978 

<210> 164 
<211> 325 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 164 



Val 


Thr 


lie 


He 


Lys 


Ala 


Ala 


Ala 


Val 


Gin 


He 


Ser 


Pro 


Val 


Leu 


Tyr 


1 








5 










10 










15 




Ser 


Arg 


Glu 


Ala 


Thr 


Val 


Glu 


Lys 


Val 


Val 


Arg 


Glu 


Thr 


Arg 


Glu 


Leu 








20 










25 










30 






Gly 


Gin 


Lys 


Gly 


Val 


Gin 


Phe 


Ala 


Thr 


Phe 


Pro 


Glu 


Thr 


Val 


Val 


Pro 






35 










40 










45 








Tyr 


Tyr 


Pro 


Tyr 


Phe 


Ser 


Ala 


Val 


Gin 


Thr 


Gly 


He 


Glu 


Leu 


Leu 


Ser 




50 










55 










60 










Gly 


Lys 


Glu 


His 


Leu 


Arg 


Leu 


Leu 


Glu 


Gin 


Ala 


Val 


Thr 


Val 


Pro 


Ser 


65 










70 










75 










80 


Pro 


Ala 


Thr 


Asp 


Ala 


He 


Ala 


Gin 


Ala 


Ala 


Arg 


Glu 


Ala 


Gly 


Met 


Val 










85 










90 










95 




Val 


Ser 


He 


Gly 


Val 


Asn 


Glu 


Arg 


Asp 


Gly 


Gly 


Thr 


He 


Tyr 


Asn 


Thr 








100 










105 










110 






Gin 


Leu 


Leu 


Phe 


Asp 


Ala 


Asp 


Gly 


Thr 


Leu 


Val 


Gin 


Arg 


Arg 


Arg 


Lys 






115 










120 










125 








lie 


Thr 


Pro 


Thr 


His 


Phe 


Glu 


Arg 


Met 


Val 


Trp 


Gly 


Gin 


Gly 


Asp 


Gly 




130 










135 










140 










Ser 


Gly 


Leu 


Arg 


Ala 


Val 


Asp 


Thr 


Lys 


Val 


Gly 


Arg 


He 


Gly 


Gin 


Leu 


145 










150 










155 










160 


Ala 


Cys 


Phe 


Glu 


His 


Asn 


Asn 


Pro 


Leu 


Ala 


Arg 


Tyr 


Ala 


Met 


Met 


Ala 










165 










170 










175 




Asp 


Gly 


Glu 


Gin 


He 


His 


Ser 


Ser 


Met 


Tyr 


Pro 


Gly 


Ser 


Ala 


Phe 


Gly 








180 










185 










190 






Asp 


Gly 


Phe 


Ala 


Gin 


Arg 


Met 


Glu 


He 


Asn 


He 


Arg 


Gin 


His 


Ala 


Leu 






195 










200 










205 








Glu 


Ser 


Gly 


Cys 


Phe 


Val 


Val 


Asn 


Ala 


Thr 


Ala 


Trp 


Leu 


Asp 


Ala 


Asp 




210 










215 










220 










Gin 


Gin 


Ala 


Gin 


He 


Met 


Lys 


Asp 


Thr 


Gly 


Cys 


Ala 


He 


Gly 


Pro 


He 


225 










230 










235 










240 


Ser 


Gly 


Gly 


Cys 


Phe 


Thr 


Thr 


He 


Val 


Thr 


Pro 


Asp 


Gly 


Met 


Leu 


He 










245 










250 










255 




Gly 


Glu 


Pro 


Leu 


Arg 


Glu 


Gly 


Glu 


Gly 


Glu 


He 


He 


Ala 


Asp 


Leu 


Asp 
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260 265 270 

Phe Thr Leu lie Asp Arg Arg Lys Leu Leu Met Asp Ser Val Gly His 

275 280 285 

Tyr Asn Arg Pro Glu Leu Leu Ser Leu Leu lie Asp Arg Thr Pro Ala 

290 295 300 

Ala Asn Phe His Glu Arg Ser Thr His Pro Ala Val Asp Ala Ala Ser 
305 310 315 320 

Gly Leu Glu lie Leu 
325 

<210> 165 
<211> 1008 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 165 

atggccaatt tcaaattcaa ggcggcggcg gtgcaggccg cgcccgcttt cctcgatctc 60 

gaggctagca tcgccaagtc gatcgccctg atcgaacaag ccgccgccaa cggcgccaag 120 

ctgatcgcct ttcccgaagt cttcattccc ggctacccct ggcacatctg gctcgacagt 180 

cccgcctggg cgatcgggcg cggcttcgtc tcgcgctatt tcgagaactc gctggactac 240 

aacagccccg aggccgagcg cctcaggctc gccgtcaaga aggcgggcct gacggcggtg 300 

atcggcctct ccgagcgcga cggcggcagc ctctacatcg cgcaatggat catcggccct 360 

gacggcgaga ccgttgcgaa acggcgtaag ctccggccga cccattgcga gcgcacggtc 420 

tatggagaag gcgacggcag cgacctcgcg gttcacgacg tatctggcat cggccgtctc 480 

ggcgcgctct gctgctggga gcatatccag ccgctgtcga aattcgcgat gtattcgcaa 540 

aatgagcaag tgcacgtcgc gtcctggccg agcttctcgc tctacgaccc gttcgcgccg 600 

gcgctgggcg ccgaggtcaa caacgcagcc tcgcggatct atgcggtcga aggctcatgc 660 

ttcgtcattg cgccctgcgc gaccgtttcg cctgcaatga tcgaggaact gtgcgacgcg 720 

ccaaacaaac atgcgcttct gcacgcgggc ggcggcttcg cgcgcatcta tgggccggac 780 

ggcgcttcga tcgccgagac gctgccgcca gatcaggaag gcttgatcta cgccgacatc 840 

gacctcaccg cgatcggcgt cgccaaggcc gccgccgatc ccgccggcca ttattcgcgc 900 

cccgacgtca cgcgcctgct cttcaacaag aagcccgctc ggcgagtcga aacttttgct 960 

ttgcccgtcg atgcgccggc gccggagacg cagaccgccg cgagctga 1008 

<210> 166 
<211> 335 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 166 

Met Ala Asn Phe Lys Phe Lys Ala Ala Ala Val Gin Ala Ala Pro Ala 

15 10 15 

Phe Leu Asp Leu Glu Ala Ser lie Ala Lys Ser lie Ala Leu lie Glu 

20 25 30 

Gin Ala Ala Ala Asn Gly Ala Lys Leu He Ala Phe Pro Glu Val Phe 

35 40 45 

He Pro Gly Tyr Pro Trp His He Trp Leu Asp Ser Pro Ala Trp Ala 

50 55 60 

He Gly Arg Gly Phe Val Ser Arg Tyr Phe Glu Asn Ser Leu Asp Tyr 
65 70 75 80 

Asn Ser Pro Glu Ala Glu Arg Leu Arg Leu Ala Val Lys Lys Ala Gly 

85 90 95 

Leu Thr Ala Val He Gly Leu Ser Glu Arg Asp Gly Gly Ser Leu Tyr 

100 105 110 

He Ala Gin Trp He He Gly Pro Asp Gly Glu Thr Val Ala Lys Arg 
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115 








120 


125 




Arg 


Lys 


Leu 


Arg 


Pro 


Thr His 


Cys Glu Arg Thr Val 


Tyr Gly Glu 


Gly 




130 








135 


140 






Asp Gly 


Ser 


Asp 


Leu 


Ala Val 


His Asp Val Ser Gly 


He Gly Arg 


Leu 


145 










150 


155 




160 


Gly Ala 


Leu 


Cvs 


Cys 


Trp Glu 


His He Gin Pro Leu 


Ser Lys Phe 


Ala 










165 




170 


175 




Met 


Tyr 


Ser 


Gin 


Asn 


Glu Gin 


Val His Val Ala Ser 


Trp Pro Ser 


Phe 








180 






185 


190 




Ser 


Leu 


Tvr 


Asd 


Pro 


Phe Ala 


Pro Ala Leu Gly Ala 


Glu Val Asn 


Asn 






195 








200 


205 




Ala 


Ala 


Ser 


Ara 


He Tyr Ala 


Val Glu Gly Ser Cys 


Phe Val He 


Ala 




210 








215 


220 






Pro 


Cys 


Ala 


Thr 


Val 


Ser Pro 


Ala Met He Glu Glu 


Leu Cys Asp 


Ala 


225 










230 


235 




240 


Pro 


Asn 


Lys 


His 


Ala 


Leu Leu 


His Ala Gly Gly Gly 


Phe Ala Arg 


He 










245 




250 


255 




Tyr 


Gly 


Pro 


Asp 


Gly Ala Ser 


He Ala Glu Thr Leu 


Pro Pro Asp 


Gin 








260 






265 


270 




Glu Gly 


Leu 


He 


Tyr Ala Asp 


He Asp Leu Thr Ala 


He Gly Val 


Ala 






275 








280 


285 




Lys 


Ala 


Ala 


Ala 


Asp 


Pro Ala 


Gly His Tyr Ser Arg 


Pro Asp Val 


Thr 




290 








295 


300 






Arg 


Leu 


Leu 


Phe 


Asn 


Lys Lys 


Pro Ala Arg Arg Val 


Glu Thr Phe 


Ala 


305 










310 


315 




320 


Leu 


Pro 


Val 


Asp 


Ala 


Pro Ala 


Pro Glu Thr Gin Thr 


Ala Ala Ser 












325 




330 


335 





<210> 167 
<211> 1017 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 167 

atgggtattg aacatccgaa gtacagggtt gccgtggtgc aggccgcacc ggcctggctc 60 

gatcttgacg cgtcgatcga caagtcgatc gcgctgatcg aggaggctgc ccagaaaggc 120 

gccaagctga tcgcattccc cgaggccttc atccccggct acccctggca tatctggatg 180 

gactcgcccg cctgggcgat tggccgcggt tttgtgcagc gctacttcga caattcgctg 240 

gcctatgaca gcccgcaggc cgagaagctg cgcgcggccg tgcgcaaggc aaaactcacg 300 

gccgtgatcg gcttgtcgga gcgtgacggc ggcagccttt atctcgcaca atggctgatc 360 

ggccccgacg gcgagaccat cgcaaaacgg cgcaagctgc ggccgacaca tgccgagcgc 420 

actgtgtacg gcgagggcga cggcagcgac cttgcggtcc acaatcgtcc ggacatcggc 480 

aggctcggtg cgctctgctg ctgggagcat cttcagccac tgtcgaaata cgcgatgtac 540 

gcgcagaacg agcaggtgca cgtcgcggcc tggccgagct tttcgctcta cgatcccttc 600 

gccgtggcgc tcggcgccga ggtgaacaac gcggcctccc gcgtctatgc ggtcgaaggc 660 

tcctgcttcg tgctggcgcc gtgcgcgaca gtctcgcaag ccatgatcga cgagctctgc 720 

gatcggccgg acaagcacgc gctgctgcat gtcggcggcg gctttgccgc gatctacggg 780 

cccgacggca gccagatcgg cgacaagctc gcccccgacc aggagggcct gttgatcgcc 840 

gagatcgatc tcggcgccat aggtgtcgcc aagaacgccg cggatcccgc cgggcactat 900 

tcgcggcccg acgtgacgcg gctgttgctc aacaagaaac cgtacaagcg cgtcgaacag 960 

ttctcgccgc cgtcggaggc ggttgaaccc acggatatcg cggcggcggc aagctga 1017 

<210> 168 

<211> 338 

<212> PRT 

<213> Unknown 

<220> 
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<223> Obtained from an environmental sample 
<400> 168 

Met Gly He Glu His Pro Lys Tyr Arg Val Ala Val Val Gin Ala Ala 

15 10 15 

Pro Ala Trp Leu Asp Leu Asp Ala Ser He Asp Lys Ser lie Ala Leu 

20 25 30 

He Glu Glu Ala Ala Gin Lys Gly Ala Lys Leu He Ala Phe Pro Glu 

35 40 45 

Ala Phe He Pro Gly Tyr Pro Trp His He Trp Met Asp Ser Pro Ala 

50 55 60 

Trp Ala He Gly Arg Gly Phe Val Gin Arg Tyr Phe Asp Asn Ser Leu 
65 70 75 80 

Ala Tyr Asp Ser Pro Gin Ala Glu Lys Leu Arg Ala Ala Val Arg Lys 

85 90 95 

Ala Lys Leu Thr Ala Val He Gly Leu Ser Glu Arg Asp Gly Gly Ser 

100 105 110 

Leu Tyr Leu Ala Gin Trp Leu He Gly Pro Asp Gly Glu Thr He Ala 

115 120 125 

Lys Arg Arg Lys Leu Arg Pro Thr His Ala Glu Arg Thr Val Tyr Gly 

130 135 140 

Glu Gly Asp Gly Ser Asp Leu Ala Val His Asn Arg Pro Asp He Gly 
145 150 155 160 

Arg Leu Gly Ala Leu Cys Cys Trp Glu His Leu Gin Pro Leu Ser Lys 

165 170 175 

Tyr Ala Met Tyr Ala Gin Asn Glu Gin Val His Val Ala Ala Trp Pro 

180 185 190 

Ser Phe Ser Leu Tyr Asp Pro Phe Ala Val Ala Leu Gly Ala Glu Val 

195 200 205 

Asn Asn Ala Ala Ser Arg Val Tyr Ala Val Glu Gly Ser Cys Phe Val 

210 215 220 

Leu Ala Pro Cys Ala Thr Val Ser Gin Ala Met He Asp Glu Leu Cys 
225 230 235 240 

Asp Arg Pro Asp Lys His Ala Leu Leu His Val Gly Gly Gly Phe Ala 

245 250 255 

Ala He Tyr Gly Pro Asp Gly Ser Gin He Gly Asp Lys Leu Ala Pro 

260 265 270 

Asp Gin Glu Gly Leu Leu He Ala Glu He Asp Leu Gly Ala He Gly 

275 280 285 

Val Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp 

290 295 300 

Val Thr Arg Leu Leu Leu Asn Lys Lys Pro Tyr Lys Arg Val Glu Gin 
305 310 315 320 

Phe Ser Pro Pro Ser Glu Ala Val Glu Pro Thr Asp He Ala Ala Ala 
325 330 335 

Ala Ser 



<210> 169 
<211> 1077 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 169 

atggccctca cccatccgaa attgaaagtc gccgccgtgc aggcagctcc cgcgttcctc 60 
gatgtcgatg ccgcagtgga caaagcggtg cggctaatcg acgaagcggc agcaaacggc 120 
tccagtctgg tggcattccc cgagacctgg atccccggct atccgttttg gatctggctt 180 
ggctcgccgg. cctgggcaat catgcgcggg tttgtgtctc gctatttcga taattcgctc 240 
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agctatgaca gccggcaggc agagcgcctg cgcgacgccg cgaagcgcca caaactgacc 300 

gtcgtcatgg gcctgtccga gcgcgccggc ggtagccttt acatcgcgca gtggatcatt 360 

ggtcccaatg gcgagaccgt cgcacagcgg cgcaagctca agcccaccca tgcggagcgc 420 

accgtcttcg gcgagggtga cggcagccac ctggcggtac acaatcttcc aatcggacgg 480 

ctcggtgcgc tgtgctgctg ggagcacctc cagccgctct ccaaatacgc gatgtacgcc 540 

cagaacgaag agatccacgt ggcggcatgg ccgtccttct cgctctacga cccgtttgcg 600 

cacgcgctcg gcgccgaagt caacaacgca gcgagccaga tctacgcggt tgaaggttcc 660 

tgctttgtcg tcgcgccatg tgcggtgatc tcgcaggaaa tgatcgatct tatgtgcgat 720 

acccccgaca agcatcagct tattcacgtc ggtggcggct tcaccgtgat ctatggcccg 780 

gacggtgcgc gcatcggcga caagctcgcg ccagatcagg aaggcattgt ctatgccgac 840 

atcgatctcg gcatgatccc gatcgcgaaa gctgccgccg atcctgccgg ccactatgcg 900 

cgacccgacg ttacccgcct tctgttcaac aatcgtcccg ccaatcgggt ggaaaccctc 960 

gtgctccccg ttgatcaggt ccgtgacatc gatgcacgtg tggaggccgc ggcacctcag 1020 

gcgcgaccag caaccgggaa cgaggatccc gccgcaaagc ctatggccgc cgaatga 1077 

<210> 170 
<211> 358 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 170 



Met Ala 


Leu 


Thr 


His 


Pro 


Lys 


Leu 


Lys 


Val 


Ala 


Ala 


Val 


Gin 


Ala 


Ala 


1 






5 










10 










15 




Pro Ala 


Phe 


Leu 


Asp 


Val 


Asp 


Ala 


Ala 


Val 


Asp 


Lys 


Ala 


Val 


Arg 


Leu 






20 










25 










30 






He Asp 


Glu 


Ala 


Ala 


Ala 


Asn 


Gly 


Ser 


Ser 


Leu 


Val 


Ala 


Phe 


Pro 


Glu 




35 










40 










45 








Thr Trp 


He 


Pro 


Gly 


Tyr 


Pro 


Phe 


Trp 


He 


Trp 


Leu 


Gly 


Ser 


Pro 


Ala 


50 










55 










60 










Trp Ala 


He 


Met 


Arg 


Gly 


Phe 


Val 


Ser 


Arg 


Tyr 


Phe 


Asp 


Asn 


Ser 


Leu 


65 








70 










75 










80 


Ser Tyr 


Asp 


Ser 


Arg 


Gin 


Ala 


Glu 


Arg 


Leu 


Arg 


Asp 


Ala 


Ala 


Lys 


Arg 








85 










90 










95 




His Lys 


Leu 


Thr 


Val 


Val 


Met 


Gly 


Leu 


Ser 


Glu 


Arg 


Ala 


Gly 


Gly 


Ser 






100 










105 










110 






Leu Tyr 


He 


Ala 


Gin 


Trp 


He 


He 


Gly 


Pro 


Asn 


Gly 


Glu 


Thr 


Val 


Ala 




115 










120 










125 








Gin Arg 


Arg 


Lys 


Leu 


Lys 


Pro 


Thr 


His 


Ala 


Glu 


Arg 


Thr 


Val 


Phe 


Gly 


130 










135 










140 










Glu Gly 


Asp 


Gly 


Ser 


His 


Leu 


Ala 


Val 


His 


Asn 


Leu 


Pro 


He 


Gly 


Arg 


145 








150 










155 










160 


Leu Gly 


Ala 


Leu 


Cys 


Cys 


Trp 


Glu 


His 


Leu 


Gin 


Pro 


Leu 


Ser 


Lys 


Tyr 








165 










170 










175 




Ala Met 


Tyr 


Ala 


Gin 


Asn 


Glu 


Glu 


He 


His 


Val 


Ala 


Ala 


Trp 


Pro 


Ser 






180 










185 










190 






Phe Ser 


Leu 


Tyr 


Asp 


Pro 


Phe 


Ala 


His 


Ala 


Leu 


Gly 


Ala 


Glu 


Val 


Asn 




195 










200 










205 








Asn Ala 


Ala 


Ser 


Gin 


He 


Tyr 


Ala 


Val 


Glu 


Gly 


Ser 


Cys 


Phe 


Val 


Val 


210 










215 










220 










Ala Pro 


Cys 


Ala 


Val 


He 


Ser 


Gin 


Glu 


Met 


He 


Asp 


Leu 


Met 


Cys 


Asp 


225 






230 










235 










240 


Thr Pro 


Asp 


Lys 


His 


Gin 


Leu 


He 


His 


Val 


Gly 


Gly 


Gly 


Phe 


Thr 


Val 








245 










250 










255 




He Tyr 


Gly 


Pro 


Asp 


Gly 


Ala 


Arg 


He 


Gly 


Asp 


Lys 


Leu 


Ala 


Pro 


Asp 






260 










265 










270 






Gin Glu 


Gly 


He 


Val 


Tyr 


Ala 


Asp 


He 


Asp 


Leu 


Gly 


Met 


He 


Pro 


He 




275 










280 










285 








Ala Lys 


Ala 


Ala 


Ala 


Asp 


Pro 


Ala 


Gly 


His 


Tyr 


Ala 


Arg 


Pro 


Asp 


Val 
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290 






295 300 






Thr 


Arg 


Leu 


Leu 


Phe Asn Asn Arg Pro Ala Asn Arg Val Glu 


Thr 


Leu 


305 








310 315 




320 


Val 


Leu 


Pro 


Val 


Asp Gin Val Arg Asp lie Asp Ala Arg Val 
325 330 


Glu 
335 


Ala 


Ala 


Ala 


Pro 


Gin 
340 


Ala Arg Pro Ala Thr Gly Asn Glu Asp Pro 
345 350 


Ala 


Ala 


Lys 


Pro 


Met 
355 


Ala 


Ala Glu 







<210> 171 
<211> 1011 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 171 

atgactaagg gaacagtgaa agtcgcggcg gcgcaggtca cccccgtgtt catggatcgt 60 

aaagccacga tcgtcaaggc ctgcgacacg atcgccgagg cgggcaagaa cggcgcgcgg 120 

ctggtggtgt tcccggagac gtttgttcct ggctacccgg actgggtctg gacggcgacg 180 

gctgggacgc atcgcgatat ccaccaggcg atgtacgcgg aactgctgga ccaggctgtc 240 

tcgattccga gcccggcgac ggacgccctc tgccgtgctg caaagaaggc gggcgtctac 300 

gtcgtcatcg gcgtcaatga gctgagtggg ccgggcggaa gcctgtacaa cacgctgatc 360 

tacatcgatg acgaaggcga gatcatgggc cgccaccgca agctggtccc cacgatgggc 420 

gagcgcctgg tctgggcacc cggcgacggc agcacgctgg aggcgtacga gacatcgatc 480 

ggcaggctgg gcggactgat ctgctgggag aactacatgc cgctggcccg ctacgccatg 540 

tacgcctggg gcgtgcagat ctacgtcgcg ccgacgtggg acagctcgga cgggtgggtt 600 

ggcagcatgc agcacatcgc ccgcgaaggg cggacggcgg tgatcggctg ctgcatggcg 660 

atccgtcgca gcgacatccc ggacaagtac gagttcaaga agctgtaccc gccgagcaag 720 

agcaaagacg aagaatgggt gaacgatggc aacagcgtca tcgtcgcacc cggtggacga 780 

atactcgccg ggccggtcgc caaagaggag acgatcctct acgccgatct ggacccggca 840 

gccgagcgcg gttcaaagtt ctcgttagat gtggcagggc actacgcgcg gccggacgtc 900 

ttccagctga cggtgaatcg cggtccggca gaactggtga atgtggccgg tgatatcgca 960 

ccggcaacca acggcaaagt caaaacaccg gcgaaattac gccgcaagta a 1011 

<210> 172 
<211> 336 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 172 



Met 


Thr 


Lys 


Gly 


Thr 


Val 


Lys 


Val 


Ala 


Ala 


Ala 


Gin 


Val 


Thr 


Pro 


Val 


1 




5 










10 










15 




Phe 


Met 


Asp 


Arg 


Lys 


Ala 


Thr 


He 


Val 


Lys 


Ala Cys 


Asp 


Thr 


He 


Ala 








20 










25 










30 






Glu 


Ala 


Gly 
35 


Lys 


Asn 


Gly 


Ala 


Arg 
40 


Leu 


Val 


Val 


Phe 


Pro 
45 


Glu 


Thr 


Phe 


Val 


Pro 
50 


Gly 


Tyr 


Pro 


Asp 


Trp 
55 


Val 


Trp 


Thr 


Ala 


Thr 
60 


Ala 


Gly 


Thr 


His 


Arg 


Asp 


lie 


His 


Gin 


Ala 


Met 


Tyr 


Ala 


Glu 


Leu 


Leu 


Asp 


Gin 


Ala 


Val 


65 










70 










75 










80 


Ser 


lie 


Pro 


Ser 


Pro 
85 


Ala 


Thr 


Asp 


Ala 


Leu 
90 


Cys 


Arg 


Ala 


Ala 


Lys 
95 


Lys 


Ala 


Gly 


Val 


Tyr 
100 


Val 


Val 


He 


Gly 


Val 
105 


Asn 


Glu 


Leu 


Ser 


Gly 
110 


Pro 


Gly 


Gly 


Ser 


Leu 


Tyr 


Asn 


Thr 


Leu 


He 


Tyr 


He 


Asp Asp 


Glu 


Gly 


Glu 


He 
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115 








120 








125 




Met 


Gly 


Arg 


His Arg 


Lys 


Leu 


Val 


Pro 


Thr 


Met Gly 


Glu Arg Leu 


Val 




130 








135 








140 






Trp 


Ala 


Pro 


Gly Asp 


Gly 


Ser 


Thr 


Leu 


Glu 


Ala Tyr 


Glu Thr Ser 


He 


145 








150 










155 




160 


Gly 


Arg 


Leu 


Gly Gly 


Leu 


He 


Cys 


Trp 


Glu 


Asn Tyr 


Met Pro Leu 


Ala 








165 










170 




175 




Arg 


Tyr 


Ala 


Met Tyr 


Ala 


Trp 


Gly 


Val 


Gin 


He Tyr 


Val Ala Pro 


Thr 








180 








185 






190 




Trp 


Asp 


Ser 


Ser Asp 


Gly 


Trp 


Val 


Gly 


Ser 


Met Gin 


His He Ala 


Arg 






195 








200 








205 


Glu 


Gly 


Arg 


Thr Ala 


Val 


He 


Gly 


Cys 


Cys 


Met Ala 


He Arg Arg 


Ser 




210 








215 








220 






Asp 


He 


Pro 


Asp Lys 


Tyr 


Glu 


Phe 


Lys 


Lys 


Leu Tyr 


Pro Pro Ser 


Lys 


225 








230 










235 




240 


Ser 


Lys 


Asp 


Glu Glu 


Trp 


Val 


Asn 


Asp 


Gly 


Asn Ser 


Val He Val 


Ala 








245 










250 




255 




Pro 


Gly 


Gly 


Arg He 


Leu 


Ala 


Gly 


Pro 


Val 


Ala Lys 


Glu Glu Thr 


He 








260 








265 






270 




Leu 


Tyr 


Ala 


Asp Leu 


Asp 


Pro 


Ala 


Ala 


Glu 


Arg Gly 


Ser Lys Phe 


Ser 






275 








280 








285 




Leu 


Asp 


Val 


Ala Gly 


His 


Tyr 


Ala 


Arg 


Pro 


Asp Val 


Phe Gin Leu 


Thr 




290 








295 








300 






Val 


Asn 


Arg 


Gly Pro 


Ala 


Glu 


Leu 


Val 


Asn 


Val Ala 


Gly Asp He 


Ala 


305 








310 










315 


320 


Pro 


Ala 


Thr 


Asn Gly 


Lys 


Val 


Lys 


Thr 


Pro 


Ala Lys 


Leu Arg Arg 


Lys 








325 










330 




335 





<210> 173 
<211> 993 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 173 

atgaaagttg 

acggtcgaga 

accttcccgg 

cagattatag 

cctgccaccg 

gtcaacgagc 

gccttgatcc 

cagggcgacg 

gcatgctggg 

atccattcgg 

atcaatatcc 

ctggacgccg 

tccggcggtt 

cgctcgggcg 

caggtgatgg 

cgcacaccca 

cgttcggagg 



ttaaagccgc 
gggtcgttcg 
agaccgtagt 
ccggacctga 
acgctatcag 
gtgacggcgg 
agcgccgccg 
ggtcgggcct 
agcacaacaa 
caatgtatcc 
ggcagcatgc 
atcagcaggc 
gcttcaccgc 
aaggcatggt 
actcgcgcgg 
ctgcgcatgt 
aacttctcgc 



cgcagtccaa 
gaagatccac 
gccttactac 
gcatctaaag 
cgaggctgcc 
aaccctgtac 
caagattacg 
gcgcgctgtc 
ccccctggcg 
gggctccatg 
attggagtct 
gcaaatctgc 
gatcgtggcg 
catcgtcgac 
ccactacaac 
tcacgaccgc 
tacggctgtc 



ctgagtcccg 
gagcttggcc 
ccgtactttt 
ctgctcgacc 
aggcaggcgg 
aacacgcagc 
cccactcatt 
gacagcaagg 
cgctacgcga 
ttcggcgacc 
gcgtgcttcg 
aaggacactg 
cctgatggaa 
ctcgacttca 
cggccggaat 
gctgtgcgcc 
taa 



tcctctatag 
ggcagggggt 
ccttcgtcca 
aggcagtgac 
gagttgtggt 
tgctcttcga 
tcgagcgcat 
tcggtcgcat 
tgatagccga 
cgtttgccca 
tcgtgtgcgc 
gctgcgacat 
ccttgctggg 
cgctcatcga 
tgctcagtct 
ccgagtcagc 



ccgcgaggga 
acagttcgcc 
gacgccctta 
cgtgccgtcc 
gtccataggc 
tgccgatggc 
gatctggggc 
tggccagctc 
cggcgagcag 
gaagacggaa 
cacggcctgg 
cggcccgatc 
cgagcccatc 
caagcgcaag 
cctgatcgac 
cgcggagcaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
993 



<210> 174 
<211> 330 
<212> PRT 
<213> Unknown 



<220> 
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<223> Obtained from an environmental sample 
<400> 174 

Met Lys Val Val Lys Ala Ala Ala Val Gin Leu Ser Pro Val Leu Tyr 

15 10 15 

Ser Arg Glu Gly Thr Val Glu Arg Val Val Arg Lys lie His Glu Leu 

20 25 30 

Gly Arg Gin Gly Val Gin Phe Ala Thr Phe Pro Glu Thr Val Val Pro 

35 40 45 

Tyr Tyr Pro Tyr Phe Ser Phe Val Gin Thr Pro Leu Gin lie lie Ala 

50 55 60 

Gly Pro Glu His Leu Lys Leu Leu Asp Gin Ala Val Thr Val Pro Ser 
65 70 75 80 

Pro Ala Thr Asp Ala lie Ser Glu Ala Ala Arg Gin Ala Gly Val Val 

85 90 95 

Val Ser lie Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn Thr 

100 105 110 

Gin Leu Leu Phe Asp Ala Asp Gly Ala Leu lie Gin Arg Arg Arg Lys 

115 120 125 

lie Thr Pro Thr His Phe Glu Arg Met He Trp Gly Gin Gly Asp Gly 

130 135 -140 

Ser Gly Leu Arg Ala Val Asp Ser Lys Val Gly Arg He Gly Gin Leu 
145 150 155 160 

Ala Cys Trp Glu His Asn Asn Pro Leu Ala Arg Tyr Ala Met He Ala 

165 170 175 

Asp Gly Glu Gin He His Ser Ala Met Tyr Pro Gly Ser Met Phe Gly 

180 185 190 

Asp Pro Phe Ala Gin Lys Thr Glu He Asn He Arg Gin His Ala Leu 

195 200 205 

Glu Ser Ala Cys Phe Val Val Cys Ala Thr Ala Trp Leu Asp Ala Asp 

210 215 220 

Gin Gin Ala Gin He Cys Lys Asp Thr Gly Cys Asp He Gly Pro He 
225 230 235 240 

Ser Gly Gly Cys Phe Thr Ala He Val Ala Pro Asp Gly Thr Leu Leu 

245 250 255 

Gly Glu Pro He Arg Ser Gly Glu Gly Met Val He Val Asp Leu Asp 

260 265 270 

Phe Thr Leu He Asp Lys Arg Lys Gin Val Met Asp Ser Arg Gly His 

275 280 285 

Tyr Asn Arg Pro Glu Leu Leu Ser Leu Leu He Asp Arg Thr Pro Thr 

290 295 300 

Ala His Val His Asp Arg Ala Val Arg Pro Glu Ser Ala Ala Glu Gin 
305 310 315 320 

Arg Ser Glu Glu Leu Leu Ala Thr Ala Val 
325 330 

<210> 175 
<211> 945 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 175 

atgaccacct acgggcagtt tagacttgcc gcgatcaatg ccgcgcctgt ctacttcgac 60 
agggaagcat ccaccgaaaa agcttgccgg ctcattctcg aagcgggggc atcaggggcg 120 
acgctggcgg cgtttggcga gacgtggctt cccggctatc cattccacat ctggagggaa 



180 



gttccgactg ctctccgagt ggaatacatc gctaatgccg tcgagattcc cagcccaacg 240 
accgaccgat tgtgcgcggc ggctcgtcag gcgaacatcg atgttgtgat cggcgttgtc 300 
gaactggatg cgcagacaca cgggacggtc tactgtacgc tcttgttcat tggcagcgat 360 
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ggctcaattc 
ggggaaggtg 
ctgtgttgct 
cagattcata 
gcattcgctt 
gaccttcggg 
atcaacccag 
gattgctcaa 
tctcgcccgg 
acgaacaacc 



tgggacgtca tcgaaagatt aaaccgactt tcgtggagcg aaccgcatgg 
acggcagcag cctgatcgtc tacgagcgcc cgtatggcaa gatcagtggt 
gggaacacaa tatggttctg ccgggctacg cgctgatggc gcaggggacg 
tcgccgcatg gcccggctgg gaaagcactc gccatctgct cttatcaaga 
ctcaggcagc ggcgtatgtg attgatgtag gcgctatcgt caatcgtgac 
aagattacca ggctttgatt gctggaagct actggggcgg aagttgcatc 
aaggcgaggt catcgctggt ccagcgaaat cggagaccat tctggttgca 
ccgagcagat ctttagctca aaagtgctct gtgatgtggg cgggcattat 
atatttttca gctccatgtc aatcgaaagc catatcaacg tatcgtcgag 
cacaccccgc tccgattgag ttcgattacc gttga 



420 
480 
540 
600 
660 
720 
780 
840 
900 
945 



<210> 176 
<211> 314 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 176 

Met Thr Thr Tyr Gly Gin Phe Arg Leu Ala Ala lie Asn Ala Ala Pro 

15 10 15 

Val Tyr Phe Asp Arg Glu Ala Ser Thr Glu Lys Ala Cys Arg Leu He 

20 25 30 

Leu Glu Ala Gly Ala Ser Gly Ala Thr Leu Ala Ala Phe Gly Glu Thr 

35 40 45 

Trp Leu Pro Gly Tyr Pro Phe His He Trp Arg Glu Val Pro Thr Ala 

50 55 60 

Leu Arg Val Glu Tyr He Ala Asn Ala Val Glu He Pro Ser Pro Thr 
65 70 75 80 

Thr Asp Arg Leu Cys Ala Ala Ala Arg Gin Ala Asn He Asp Val Val 

85 90 95 

He Gly Val Val Glu Leu Asp Ala Gin Thr His Gly Thr Val Tyr Cys 

100 105 HO 

Thr Leu Leu Phe He Gly Ser Asp Gly Ser He Leu Gly Arg His Arg 

115 120 125 

Lys He Lys Pro Thr Phe Val Glu Arg Thr Ala Trp Gly Glu Gly Asp 

130 135 140 

Gly Ser Ser Leu He Val Tyr Glu Arg Pro Tyr Gly Lys He Ser Gly 
145 150 155 160 

Leu Cys Cys Trp Glu His Asn Met Val Leu Pro Gly Tyr Ala Leu Met 

165 170 175 

Ala Gin Gly Thr Gin He His He Ala Ala Trp Pro Gly Trp Glu Ser 

180 185 190 

Thr Arg His Leu Leu Leu Ser Arg Ala Phe Ala Ser Gin Ala Ala Ala 

195 200 205 

Tyr Val He Asp Val Gly Ala He Val Asn Arg Asp Asp Leu Arg Glu 

210 215 - 220 

Asp Tyr Gin Ala Leu He Ala Gly Ser Tyr Trp Gly Gly Ser Cys He 
225 230 235 240 

He Asn Pro Glu Gly Glu Val He Ala Gly Pro Ala Lys Ser Glu Thr 

245 250 255 

He Leu Val Ala Asp Cys Ser Thr Glu Gin He Phe Ser Ser Lys Val 

260 265 270 

Leu Cys Asp Val Gly Gly His Tyr Ser Arg Pro Asp He Phe Gin Leu 

275 280 285 

His Val Asn Arg Lys Pro Tyr Gin Arg He Val Glu Thr Asn Aan Pro 

290 295 300 

His Pro Ala Pro He Glu Phe Asp Tyr Arg 
305 310 
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<210> 177 
<211> 948 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 177 

atggactcac 

acgttgtcaa 

gtgtttggtg 

gtcttcaatt 

atcgaagcag 

gtcgcgggca 

gtgtatatcg 

gaagaacgac 

gcttttacgg 

ctgtatgcgc 

caggatatta 

ttgatgcgca 

gcacccgatc 

gttgaaccgg 

cgtgaagaac 

ttgagagtca 



ttaccgttgg 
agatgttgga 
aggcgttgct 
cgccgatgca 
ggcatcttga 
tcatcgagcg 
atttgaacgg 
tcacctggtc 
ttggcaaact 
aaggcgaaga 
cgcgctttat 
agagtgactt 
cactcgccaa 
ttgcggatga 
gccagaactt 
accgccagcg 



tctcgcgcaa 
acaggttcga 
ccccggttat 
aaaggaaatc 
tccattgtgc 
tccgttggat 
tgtcatccaa 
gcctggcgat 
caattgttgg 
tctgcacgtt 
cgcaatggag 
cccacaagat 
cggaggttcg 
agagaagttg 
tgatccatcc 
acaaagcgtt 



atcgcgcccg 
gcggcgaaag 
ccattttgga 
cacgcgcact 
ggcgcggcaa 
cgcggcggac 
tcggtgcatc 
ggtcatgggt 
gaaaactgga 
gctgtctggc 
tcgcgatcgt 
acgcctcatc 
tgtctggctg 
atcgtcgcca 
ggtcattaca 
atcgcttttg 



tttggttgaa 
aggcgggctg 
tcgaactgac 
acatggatca 
aagcgcacgg 
atagtttata 
gcaaactgat 
tacgcgtgca 
tgccgctgcc 
ccgggtccgt 
ttgtcgtttc 
tctccgccat 
gacctgacgg 
ccattgacca 
gccgaccaga 
atgagtag 



tcgggcgggg 
tcagcttgtt 
gaacggcgca 
agctgtgcag 
catcaccgtg 
tgcgagtctg 
gcccacctat 
tacactgggc 
gcgcgcggct 
gcgcaacaca 
ggtttcgagt 
tcttgaatct 
taaatggatc 
tgcccgtgta 
tgtgacacaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
948 



<210> 178 
<211> 315 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 178 












Met Asp 


Ser 


Leu 


Thr 


Val 


Gly 


Leu 


1 






5 








Asn Arg 


Ala 


Gly 
20 


Thr 


Leu 


Ser 


Lys 


Lys Glu 


Ala 
35 


Gly 


Cys 


Gin 


Leu 


Val 
40 


Gly Tyr 


Pro 


Phe 


Trp 


He 


Glu 


Leu 


50 










55 




Pro Met 


Gin 


Lys 


Glu 


He 


His 


Ala 


65 








70 






lie Glu 


Ala 


Gly 


His 
85 


Leu 


Asp 


Pro 


Gly lie 


Thr 


Val 
100 


Val 


Ala 


Gly 


He 


Gly His 


Ser 


Leu 


Tyr 


Ala 


Ser 


Leu 


115 










120 


He Gin 


Ser 


Val 


His Arg 


Lys 


Leu 


130 










135 




Thr Trp 


Ser 


Pro 


Gly Asp 


Gly 


His 


145 








150 






Ala Phe 


Thr 


Val 


Gly Lys 


Leu 


Asn 








165 








Pro Arg 


Ala 


Ala 
180 


Leu 


Tyr 


Ala 


Gin 


Trp Pro 


Gly 
195 


Ser 


Val 


Arg 


Asn 


Thr 
200 



Ala 


Gin 
10* 


He Ala 


Pro 


Val 


Trp 
15 


Leu 


Met 


Leu 


Glu Gin 


Val 


Arg 


Ala 


Ala 


25 








30 






Val 


Phe 


Gly Glu 


Ala 
45 


Leu 


Leu 


Pro 


Thr 


Asn 


Gly Ala 
60 


Val 


Phe 


Asn 


Ser 


His 


Tyr 


Met Asp 


Gin 


Ala 


Val 


Gin 




75 








80 


Leu 


Cys 
90 


Gly Ala 


Ala 


Lys 


Ala 
95 


His 


He 


Glu 


Arg Pro 


Leu 


Asp 


Arg 


Gly 


105 








110 






Val 


Tyr 


He Asp 


Leu 
125 


Asn 


Gly 


Val 


Met 


Pro 


Thr Tyr 
140 


Glu 


Glu 


Arg 


Leu 


Gly Leu 


Arg Val 


His 


Thr 


Leu 


Gly 






155 








160 


Cys 


Trp 


Glu Asn 


Trp 


Met 


Pro 


Leu 


170 








175 




Gly Glu 


Asp Leu 


His 


Val 


Ala 


Val 


185 








190 






Gin 


Asp 


He Thr 


Arg 


Phe 


He 


Ala 



205 
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Met laxU 


Ser 


Arg 


Ser 


Phe Val 


\Ts 1 Car \7a 1 Cor 

Vol OCX vox Oct 


OCI 


Leu 




Lys 










215 




C.£.\i 








Ser Asp 


Phe 


Pro 


Gin Asp Thr 


irlO tlx S iicu Ocx 


Ax a 


Tl a 

lie 




Ser 










230 












Ala Pro 


Asp 


Pro 


Leu 


Ala Asn 


Gly Gly Ser Cys 


Leu 


Ala 


Gly Pro 


Asp 








245 




250 






255 




Gly Lys 


Trp 


He 


Val 


Glu Pro 


Val Ala Asp Glu 


Glu 


Lys 


Leu He 


Val 






260 






265 






270 




Ala Thr 


He 


Asp 


His 


Ala Arg 


Val Arg Glu Glu 


Arg 


Gin 


Asn Phe 


Asp 




275 








280 




285 






Pro Ser 


Gly His 


Tyr 


Ser Arg 


Pro Asp Val Thr 


Gin 


Leu 


Arg Val 


Asn 


290 








295 




300 








Arg Gin 


Arg 


Gin 


Ser 


Val He 


Ala Phe Asp Glu 










305 








310 


315 











<210> 179 
<211> 915 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
<400> 179 

atgaccacca aagcagecat cattcaggcg cgccccatat actacgatct ggcggcgtgt 60 

gtcgataagg cgcttgccct catcaccgag gcggcggcac gcggcgcgaa catcgtcacg 120 

cteggegaga cgtggctgcc gggctatccc gcgtggctgg atgtgtgcgt cgagatgggg 180 

ctgtgggatc acgcgccgac caaagccgtc tttcagegge tccatgccaa cagcgtcacc 240 

atccccggcg eggagatcag ccagttctgc gacatcgccc gccgccttag catcgtgctg 300 

gtgctcagcg teaacgageg cgtccgcaac accttgttca acaccctgct cacgattgac 360 

gagegeggeg acatccgcaa ccaccaccgc aagctgatgc cgacctacac tgagegcate 420 

gtctgggggc agggegaegg cgegggctta caggeggteg agaeggcaac cgggcgcgtc 4 80 

ggcgggctga tctgctggga acactggatg ccgctggcac ggcaggeget gcacaacgcc 540 

ggggagcaaa ttcacgtttc ggtcttcccg accgtcaacg acccgcgcca ccaagtcgcc 600 

agccgccagt aegctttega ggggcgctgc ttcgtgctga ccgccggcag catccagcgc 660 

gccgacgacc taccgccgga actgaccgtc aaggegggea tcgcgccgga tgatctggtg 720 

cagggeggeg gcagcgccat catcgcgccg gaeatgeget acctcgccgg accctgcttc 780 

gacgaggaaa ccatcctcta cgccgacctc gaectgageg agacgatccg cgagagcatg 840 

acgctggacg tgagegggea ttactcgcgc cccgacgtgt tcaccttcga ggttaatcgg 900 

cageggaaaa tttag 915 

<210> 180 
<211> 304 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 180 



Met 


Thr 


Thr 


Lys 


Ala 


Ala 


He 


He 


Gin Ala Arg 


Pro 


He 


Tyr 


Tyr 


Asp 


1 








5 








10 








15 




Leu 


Ala 


Ala 


Cys 


Val 


Asp 


Lys 


Ala 


Leu Ala Leu 


He 


Thr 


Glu 


Ala 


Ala 








20 










25 






30 






Ala 


Arg 


Gly 


Ala 


Asn 


He 


Val 


Thr 


Leu Gly Glu 


Thr 


Trp 


Leu 


Pro 


Gly 






35 










40 






45 








Tyr 


Pro 


Ala 


Trp 


Leu 


Asp 


Val 


Cys 


Val Glu Met 


Gly 


Leu 


Trp 


Asp 


His 




50 










55 






60 










Ala 


Pro 


Thr 


Lys 


Ala 


Val 


Phe 


Gin 


Arg Leu His 


Ala 


Asn 


Ser 


Val 


Thr 


65 










70 






75 










80 


He 


Pro 


Gly 


Ala 


Glu 


He 


Ser 


Gin 


Phe Cys Asp 


He 


Ala 


Arg 


Arg 


Leu 
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85 90 95 

Ser lie Val Leu Val Leu Ser Val Asn Glu Arg Val Arg Asn Thr Leu 

100 105 110 

Phe Asn Thr Leu Leu Thr lie Asp Glu Arg Gly Asp lie Arg Asn His 

115 120 125 

His Arg Lys Leu Met Pro Thr Tyr Thr Glu Arg He Val Trp Gly Gin 

130 135 140 

Gly Asp Gly Ala Gly Leu Gin Ala Val Glu Thr Ala Thr Gly Arg Val 
145 150 155 160 

Gly Gly Leu He Cys Trp Glu His Trp Met Pro Leu Ala Arg Gin Ala 

165 170 175 

Leu His Asn Ala Gly Glu Gin He His Val Ser Val Phe Pro Thr Val 

180 185 190 

Asn Asp Pro Arg His Gin Val Ala Ser Arg Gin Tyr Ala Phe Glu Gly 

195 200 205 

Arg Cys Phe Val Leu Thr Ala Gly Ser He Gin Arg Ala Asp Asp Leu 

210 215 220 

Pro Pro Glu Leu Thr Val Lys Ala Gly He Ala Pro Asp Asp Leu Val 
225 230 235 240 

Gin Gly Gly Gly Ser Ala He He Ala Pro Asp Met Arg Tyr Leu Ala 

245 250 255 

Gly Pro Cys Phe Asp Glu Glu Thr He Leu Tyr Ala Asp Leu Asp Leu 

260 265 270 

Ser Glu Thr He Arg Glu Ser Met Thr Leu Asp Val Ser Gly His Tyr 

275 280 285 

Ser Arg Pro Asp Val Phe Thr Phe Glu Val Asn Arg Gin Arg Lys He 
290 295 300 

<210> 181 

<211> 990 

<212> DNA 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 181 

atgcccaaga ccgtacgcgc agccgcagtc cagatcgcgc ccgacctgac gtcacgcgct 60 

ggcaccgtcg agcgggtcct gaatgcgatt gccgaagcct ctgacaaagg cgcggagctg 120 

attgtatttc cagaaacctt tgtgccttgg tatccgtatt tcagcttcgt tctgccgcct 180 

gtccagcagg gacccgagca cctgcggctt tatgaagaag cggtgacggt accatcagca 240 

gaaacacggg ccgtcgccga cgccgcgcgc aaacgcaatg cggtcatcgt ccttggcgtc 300 

aatgagcgcg accacggctc gctctacaac acccagctga tcttcgacgc ggatggcagc 360 

ctgaaactca agcgccgcaa gatcacgccc acctatcacg agcggatgat ctggggacag 4 20 

ggcgatggcg ccggtctaaa agtggtcgaa actgccatcg gccgcatggg cgcattggcg 4 80 

tgctgggagc actacaaccc cctcgcccga tacgcgctga tggctcagca tgaggaaatt 54 0 

cacgcctctc attttccggg ctcactggtc ggcccgatat tcggcgagca gatcgaagtc 600 

acgatgcgcc accacgcgtt ggaatcgggc tgtttcgtgg tcaatgccac cggctggtta 660 

agcgaggagc agatcgcgtc cattcaccca gatcccagcc tgcagaaggg tcttcgagat 720 

ggctgcatga cctgcatcat aaccccggaa ggccgccacg tcgttcctcc tctgacatcg 780 

ggtgaaggaa tcctgattgg cgacctggac atgcggctca tcaccaagcg caagcgaatg QAn 
atggattccg tcggacacta tgcacgtcct gagctgctgc accttgtcca tgacacgacg 
cccgcacgcg cacgcgagca ggtgggcctt tcaggcgatt tttccgatgc agggcaagac 
aagctatttg aggaggttca agatgcgtga 

<210> 182 
<211> 329 
<212> PRT 
<213> Unknown 

<220> 



840 
900 
960 
990 
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<223> Obtained from an environmental sample 



<400> 182 



Met 


Pro 


Lys 


Thr Val Arg 


Ala Ala Ala Val 


Gin He Ala Pro Asp 


Leu 


1 






5 


10 


15 




Thr 


Ser 


Arg 


Ala Gly Thr 


Val Glu Arg Val 


Leu Asn Ala He Ala 


Glu 








20 




25 


30 




Ala 


Ser 


Asp 


Lys 


Gly Ala 


Glu Leu He Val 


Phe Pro Glu Thr Phe 


Val 






35 






40 


45 




Pro 


Trp 


Tyr 


Pro 


Tyr Phe 


Ser Phe Val Leu 


Pro Pro Val Gin Gin 


Gly 




50 








55 


60 




Pro 


Glu 


His 


Leu Arg Leu 


Tyr Glu Glu Ala 


Val Thr Val Pro Ser 


Ala 


65 








70 




75 


80 


Glu 


Thr 


Arg 


Ala 


Val Ala 


Asp Ala Ala Arg 


Lys Arg Asn Ala Val 


He 










85 


90 


95 




Val 


Leu Gly 


Val 


Asn Glu 


Arg Asp His Gly 


Ser Leu Tyr Asn Thr 


Gin 








100 




105 


110 




Leu 


He 


Phe 


Asp Ala Asp 


Gly Ser Leu Lys 


Leu Lys Arg Arg Lys 


He 






115 






120 


125 




Thr 


Pro 


Thr 


Tyr 


His Glu 


Arg Met He Trp 


Gly Gin Gly Asp Gly 


Ala 




130 








135 


140 




Glv 


Leu 


Lys 


Val 


Val Glu 


Thr Ala He Gly 


Arg Met Gly Ala Leu 


Ala 


145 








150 




155 


160 


Cvs 


Trp 


Glu 


His 


Tyr Asn 


Pro Leu Ala Arg 


Tyr Ala Leu Met Ala 


Gin 










165 


170 


175 




His 


Glu 


Glu 


He 


His Ala 


Ser His Phe Pro 


Gly Ser Leu Val Gly 


Pro 








180 




185 


190 




He 


Phe Gly 


Glu 


Gin He 


Glu Val Thr Met 


Arg His His Ala Leu 


Glu 






195 






200 


205 




Ser 


Gly Cys 


Phe 


Val Val 


Asn Ala Thr Gly 


Trp Leu Ser Glu Glu 


Gin 




210 








215 


220 




He 


Ala 


Ser 


He 


His Pro 


Asp Pro Ser Leu 


Gin Lys Gly Leu Arg 


Asp 


225 








230 




235 


240 


Gly 


Cys Met 


Thr Cys He 


He Thr Pro Glu 


Gly Arg His Val Val 


Pro 










245 


250 


255 




Pro 


Leu 


Thr 


Ser Gly Glu 


Gly He Leu He 


Gly Asp Leu Asp Met 


Arg 








260 




265 


270 




Leu 


He 


Thr 


Lys Arg Lys 


Arg Met Met Asp 


Ser Val Gly His Tyr 


Ala 






275 






280 


285 




Arg 


Pro 


Glu 


Leu 


Leu His 


Leu Val His Asp 


Thr Thr Pro Ala Arg 


Ala 




290 








295 


300 




Arg 


Glu 


Gin 


Val Gly Leu 


Ser Gly Asp Phe 


Ser Asp Ala Gly Gin 


Asp 


305 








310 




315 


320 


Lys 


Leu 


Phe 


Glu 


Glu Val 


Gin Asp Ala 














325 









<210> 183 
<211> 1002 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 183 

atggctgagt. cacgcattat tcgtgccgcc gccgcccaga ttgcgccaga tctccatgag 60 

gccagcaaaa cgctggcccg ggtgctggac gcgatcgatc aggcagccgc acagggggca 120 

gagatcatcg tctttcccga gacctttgtg ccttattacc cctacttctc gtttatcacg 180 

cccgcgatga ccgccggagc ggcccatctg aaattgtatg accaggcggt ggtggtgccc 240 

ggcccgatca cccatgcggt gggcgaacgc gcccgcctgc gcaacatcgt cgtggtgctg 300 

ggggtgaatg aacgtgacca cggcacgctc tacaacaccc aactggtatt tgatgccagc 360 
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ggggaactgg tgctgaaacg ccgcaaaatc accccgacct atcacgaacg gatgatctgg 420 

ggacagggag acggtgccgg attaaaggtg gtggactcgg cggttgggcg catcggggct 480 

ttagcctgct gggagcacta caacccactg gcgcgctaca gcctgatgac tcagcacgag 540 

gagatccatt gcagccagtt ccctggttca ctggtggggc cgatttttgc cgagcagatg 600 

gacgtcacca ttcgccatca tgcactggag tccggttgct ttgtcatcaa tgccaccggc 660 

tggctgaccg aggagcagat caacgagctg accagcgacc cggcgttaca aaaggggctg 720 

cgtggtggct gcaacaccgc catcatctcg ccggaaggcc gccatctggt gccgccactg 7 80 

accgaaggtg aggggatttt gattgccgat ctggacatgg ccctgatcac caaacgcaaa 84 0 

cgcatgatgg attctgtcgg ccactatgcc cgaccggaat tactcagcct gcgcctcgat 900 

gcgacgcctg cccgttatgt ggtggcgcgt gataatgagt ccgaaaccgg aggaggcaac 960 

gatgcagaac gtaccgtcta cgcgccagca gctgatcact ga 1002 

<210> 184 
<211> 333 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 184 

Met Ala Glu Ser Arg He He Arg Ala Ala Ala Ala Gin He Ala Pro 

15 10 15 

Asp Leu His Glu Ala Ser Lys Thr Leu Ala Arg Val Leu Asp Ala He 

20 25 30 

Asp Gin Ala Ala Ala Gin Gly Ala Glu He lie Val Phe Pro Glu Thr 

35 40 45 

Phe Val Pro Tyr Tyr Pro Tyr Phe Ser Phe He Thr Pro Ala Met Thr 

50 55 60 

Ala Gly Ala Ala His Leu Lys Leu Tyr Asp Gin Ala Val Val Val Pro 
65 70 75 80 

Gly Pro He Thr His Ala Val Gly Glu Arg Ala Arg Leu Arg Asn He 

85 90 95 

Val Val Val Leu Gly Val Asn Glu Arg Asp His Gly Thr Leu Tyr Asn 

100 105 HO 

Thr Gin Leu Val Phe Asp Ala Ser Gly Glu Leu Val Leu Lys Arg Arg 

. 115 120 125 

Lys He Thr Pro Thr Tyr His Glu Arg Met He Trp Gly Gin Gly Asp 

130 135 140 

Gly Ala Gly Leu Lys Val Val Asp Ser Ala Val Gly Arg He Gly Ala 
145 150 155 160 

Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ser Leu Met 

165 170 175 

Thr Gin His Glu Glu He His Cys Ser Gin Phe Pro Gly Ser Leu Val 

180 185 190 

Gly Pro He Phe Ala Glu Gin Met Asp Val Thr He Arg His His Ala 

195 200 205 

Leu Glu Ser Gly Cys Phe Val He Asn Ala Thr Gly Trp Leu Thr Glu 

210 215 220 

Glu Gin He Asn Glu Leu Thr Ser Asp Pro Ala Leu Gin Lys Gly Leu 
225 230 235 240 

Arg Gly Gly Cys Asn Thr Ala He He Ser Pro Glu Gly Arg His Leu 

245 250 255 

Val Pro Pro Leu Thr Glu Gly Glu Gly He Leu He Ala Asp Leu Asp 

260 265 270 

Met Ala Leu He Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His 

275 280 285 

Tyr Ala Arg Pro Glu Leu Leu Ser Leu Arg Leu Asp Ala Thr Pro Ala 

290 295 300 

Arg Tyr Val Val Ala Arg Asp Asn Glu Ser Glu Thr Gly Gly Gly Asn 
305 310 315 320 
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Asp Ala Glu Arg Thr Val Tyr Ala Pro Ala Ala Asp His 
325 330 

<210> 185 
<211> 1017 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 185 

atgggcattg aacatccgaa atacaaggtc gcggtggtgc aggcggcccc cgcctggctc 60 

gatctcgacg gctcggtcga taagtcgatc gcgctgatca aggaggcggc cgagaagggg 120 

gcgaagctga tcgcctttcc cgaggccttc atccccggtt acccctggca tatctggatg 180 

gactcgccgg cctgggcgat cggccgcggc ttcgtgcagc gttatttcga caattcgctg 240 

tcctatgaca gtccgcaggc cgagcggctg cgcgatgcgg tgaagaaggc gaagctcacc 300 

gccgtgttcg gactgtccga gcgcgacggc ggcagcctct acctcgcgca atggctgatc 360 

gggcccgatg gcgagaccat cgccaagcgc cgcaagctgc ggccgaccca cgccgaacgt 420 

accgtctatg gcgaaggcga cggcagcgat cttgccgtgc atgcgcgcgc cgacatcggc 480 

cggatcggcg cgctctgctg ctgggagcat ctgcagccac tgtcgaaata cgcgatgtac 54 0 

gcccagaacg aacaggtcca tgtcgcagcc tggcccagct tctcgctgta cgaccccttc 600 

gcgccggcgt taggggccga ggtcaacaac gcggcctccc gcgtctatgc ggtggaaggc 660 

tcctgcttcg tgctcgcgcc gtgcgcgacg gtgtcgcagg cgatgatcga cgagctctgc 720 

gaccggcccg acaagaacgc gctgctgcac gtcggcggcg gctttgccgc gatctatggc 780 

cccgacggca gccagatcgg cgacaagctg gcgccggacc aggaggggct gctgatcgcc 840 

gagatcgacc ttggcgccat cggtgtcgcc aagaacgccg ccgatcccgc cgggcactat 900 

tcgcgtcccg acgtgacgcg gttgctgctc aacaagaagc gataccagcg cgtcgagcag 960 

ttcgcgctgc cggtcgacac cgtcgagccg gcggatatcg gcgcagcggc gagctga 1017 

<210> 186 
<211> 338 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 186 



Met 


Gly 


He 


Glu 


His 


Pro 


Lys 


Tyr 


Lys 


Val 


Ala Val Val 


Gin 


Ala 


Ala 


1 






5 










10 






15 




Pro 


Ala 


Trp 


Leu 


Asp 


Leu 


Asp 


Gly 


Ser 


Val 


Asp Lys Ser 


He 


Ala 


Leu 






20 










25 






30 






He 


Lys 


Glu 


Ala 


Ala 


Glu 


Lys 


Gly 


Ala 


Lys 


Leu He Ala 


Phe 


Pro 


Glu 




35 










40 






45 ■ 








Ala 


Phe 
50 


He 


Pro 


Gly 


Tyr 


Pro 
55 


Trp 


His 


He 


Trp Met Asp 
60 


Ser 


Pro 


Ala 


Trp 


Ala 


He 


Gly 


Arg 


Gly 


Phe 


Val 


Gin 


Arg 


Tyr Phe Asp 


Asn 


Ser 


Leu 


65 






70 










75 






80 


Ser 


Tyr 


Asp 


Ser 


Pro 


Gin 


Ala 


Glu 


Arg 


Leu 


Arg Asp Ala 


Val 


Lys 


Lys 






85 










90 






95 




Ala 


Lys 


Leu 


Thr 


Ala 


Val 


Phe 


Gly 


Leu 


Ser 


Glu Arg Asp 


Gly Gly 


Ser 






100 










105 






110 






Leu 


Tyr 


Leu 


Ala 


Gin 


Trp 


Leu 


He 


Gly 


Pro 


Asp Gly Glu 


Thr 


He 


Ala 




115 










120 






125 








Lys 


Arg 
130 


Arg 


Lys 


Leu 


Arg 


Pro 
135 


Thr 


His 


Ala 


Glu Arg Thr 
140 


Val 


Tyr 


Gly 


Glu 


Gly 


Asp 


Gly 


Ser 


Asp 


Leu 


Ala 


Val 


His 


Ala Arg Ala 


Asp 


He 


Gly 


145 






150 










155 






160 


Arg 


He 


Gly 


Ala 


Leu 


Cys 


Cys 


Trp 


Glu 


His 


Leu Gin Pro 


Leu 


Ser 


Lys 








165 










170 






175 
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Tyr Ala Met 


Tyr 


Ala Gin 


Asn 


Glu 


bxn 


vaj. 


U A a Val Ilia ft 1 A T T"0 
nlS Val nla nlo * *•¥ 


Pro 








180 








1 

lu J 




190 




Ser 


Phe 


Ser 


Leu 


Tyr Asp 


Pro 


Phe 


Ala 


Pro 


nla JjCU VjI y nia va 


Val 






195 






200 






205 




Asn 


Asn 


Ala 


Ala 


Ser Arg 


Val 


Tyr 


Ala 


Val 


biu vsXy ocx ^yo rue 


Val 




210 








215 












Leu 


Ala 


Pro 


Cys 


Ala Thr 


Val 


Ser 


Gin 


Ala 


Met i-Le Asp uj.u jjeu 


cys 


225 






230 












£.H\J 


Asp 


Arg 


Pro 


Asp Lys Asn 


Ala 


Leu 


Leu 


His 


Val v3ly u±y O-L nic 


Ala 






245 








250 


255 




Ala 


lie 


Tyr 


Gly 


Pro Asp Gly Ser 


Gin 


He 


Gly Asp Lys Leu Ala 


Pro 






260 








265 




270 




Asp 


Gin 


Glu 


Gly 


Leu Leu 


He 


Ala 


Glu 


He 


Asp Leu Gly Ala He 


Gly 




275 






280 






285 




Val 


Ala 


Lys 


Asn 


Ala Ala Asp 


Pro 


Ala Gly 


His Tyr Ser Arg Pro 


Asp 




290 






295 








300 




Val 


Thr Arg 


Leu 


Leu Leu 


Asn 


Lys 


Lys Arg 


Tyr Gin Arg Val Glu 


Gin 


305 








310 










315 


320 


Phe 


Ala 


Leu 


Pro 


Val Asp 


Thr 


Val 


Glu 


Pro 


Ala Asp He Gly Ala 


Ala 










325 








330 


335 




Ala 


Ser 





















<210> 187 
<211> 1059 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 187 

atgggcatca atcatcccaa gtacaaagtg gccgtcgtgc aagcggcacc tgtctggctc 60 

gacctggatg gaacagtcga caagtgeatt eggctgatag gcgaggccgc tgagaaggga 120 

tgeaagctea ttgeatttec egagaegtte ateceggggt acccctggca catctggatg 180 

ggagctcegg cctggacgat egggegegga ttcgtgcagc gatacttcga caattcgett 240 

gegtacgaca gtccgcaggc aaacaagctt cgcgccgcgg tgaagegege cggagtgacg 300 

gcagttctcg gcttgtcgga gcgccgcgga ggctccctgt acatcgccca gtggctcatc 360 

ggacctgatg gcgagaccat cgctcaacgg egaaagctge gccccaccca tgeggagege 420 

accgtcttcg gegagggega tggcagegat ttggcggtgc acagccgccc cgacatcggc 480 

cgactgggtg ccctttgctg ctgggaacat ctccagcctt tgaccaagta cgcgatgtac 540 

gcgcaagacg ageaagtgea cgtcgctgca tggecgaget tctcgatgta cgagcctttc 600 

gcgcacgccc tggggtggga gacgaacaac gcggtgagca aggtgtacgc ggtcgaaggt 660 

tcgtgctacg tcctggcccc ctgcgccatc atctctcagg cgatggtgga cgaactcgtc 720 

gaeagegagg acaagaagee gctggttcat geeggegggg ggcatgcggt gatctatggt 780 

cccgatggca ccctgcttac tcccaagctt gcagaagacg aggagggect actgatcgeg 840 

gagatcgatc tgggggcaat cggggtcgcc aagaacgegg cagaccccgc cggccactac 900 

tcgcggcccg atgtcacccg cctgctcttc aacaaccggc cggccaagcg cgtggagacg 960 

atgctgctcc cggtcgacgc ggcagaagtc gtggagccgg eggaeggage getcaatgeg 1020 

tccgagggac gccagcgaca gttcaagctg cccgcctag 105$ 

<210> 188 

<211> 352 

<212> PRT 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 188 

Met Gly He Asn His Pro Lys Tyr Lys Val Ala Val Val Gin Ala Ala 



Page 123 



WO 03/097810 PCT/US03/15712 



1 






5 








10 


15 




Pro 


Val 


Trp 


Leu Asp Leu 


Asp Gly 


Thr 


Val 


Asp Lys Cys He Arg 


Leu 








20 






25 




30 




lie 


Gly Glu 


Ala Ala Glu 


Lys 


Gly 


Cys 


Lys 


Leu He Ala Phe Pro 


Glu 






35 






40 






45 




Thr 


Phe 


He 


Pro Gly Tyr 


Pro 


Trp 


His 


He 


Trp Met Gly Ala Pro 


Ala 




50 






55 








60 




Trp 


Thr 


He 


Gly Arg Gly 


Phe 


Val 


Gin 


Arg 


Tyr Phe Asp Asn Ser 


Leu 


65 






70 










75 


80 


Ala 


Tyr Asp 


Ser Pro Gin 


Ala 


Asn 


Lys 


Leu 


Arg Ala Ala Val Lys 


Arg 








85 








90 


95 




Ala Gly Val 


Thr Ala Val 


Leu 


Gly 


Leu 


Ser 


Glu Arg Arg Gly Gly 


Ser 








100 






105 




110 




Leu 


Tyr 


He 


Ala Gin Trp Leu 


He 


Gly Pro 


Asp Gly Glu Thr He 


Ala 






115 






120 






125 




Gin Arg Arg 


Lys Leu Arg 


Pro 


Thr 


His 


Ala 


Glu Arg Thr Val Phe 


Gly 




130 






135 








140 




Glu Gly Asp 


Gly Ser Asp Leu Ala 


Val 


His 


Ser Arg Pro Asp He 


Gly 


145 






150 










155 


160 


Arg Leu Gly 


Ala Leu Cys 


Cys 


Trp 


Glu 


His 


Leu Gin Pro Leu Thr 


Lys 








165 








170 


175 




Tyr Ala Met 


Tyr Ala Gin Asp Glu 


Gin 


Val 


His Val Ala Ala Trp 


Pro 








180 






185 




190 




Ser 


Phe 


Ser 


Met Tyr Glu 


Pro 


Phe 


Ala 


His 


Ala Leu Gly Trp Glu 


Thr 






195 






200 






205 




Asn 


Asn 


Ala 


Val Ser Lys 


Val 


Tyr 


Ala 


Val 


Glu Gly Ser Cys Tyr 


Val 




210 






215 








220 




Leu 


Ala 


Pro 


Cys Ala He 


He 


Ser 


Gin 


Ala 


Met Val Asp Glu Leu 


Val 


225 






230 










235 


240 


Asp 


Ser 


Glu 


Asp Lys Lys 


Pro 


Leu 


Val 


His 


Ala Gly Gly Gly His 


Ala 








245 








250 


255 




Val 


He 


Tyr 


Gly Pro Asp 


Gly 


Thr 


Leu 


Leu 


Thr Pro Lys Leu Ala 


Glu 








260 






265 




270 




Asp 


Glu 


Glu 


Gly Leu Leu 


He 


Ala 


Glu 


He 


Asp Leu Gly Ala He 


Gly 






275 






280 






285 




Val 


Ala 


Lys 


Asn Ala Ala Asp 


Pro 


Ala Gly 


His Tyr Ser Arg Pro 


Asp 




290 






295 








300 




Val 


Thr Arg 


Leu Leu Phe 


Asn 


Asn 


Arg 


Pro 


Ala Lys Arg Val Glu 


Thr 


305 






310 










315 


320 


Met 


Leu 


Leu 


Pro Val Asp Ala Ala 


Glu 


Val 


Val Glu Pro Ala Asp 


Gly 








325 








330 


335 




Ala 


Leu 


Asn 


Ala Ser Glu 


Gly Arg 


Gin Arg 


Gin Phe Lys Leu Pro 


Ala 








340 






345 




350 





<210> 189 
<211> 1005 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 189 

atgcaagaca cgaaattcaa agttgcagtc gtccaggccg cgccggtatt catggatgcg 60 

ccagcctccg tggccaaggc gatcggtttc atccaggagg cgggcgcagc cggggcgaag 120 

ctgctggcgt tcccggaggt ctggattccg ggctaccctt ggtggctttg gctcgggacg 180 

ccggcgtggg gaatgcagtt tgtgccgcgc tatcacgcca attcgctgcg tgctgatgga 240 

cccgaaatcc tcgctctttg tgcggccgcc gccgaagcga agatcaacgt cgtgatgggc 300 

ttctccgaaa tcgacggagg aacgctctac ctaagtcagg ttttcatcag cgatgcgggc 360 

aagatcatct tcaagcgccg aaagctcaag ccgacccacg tcgaacgtac gctatttggt 420 

gaaggagatg ggtctgattt ccgagtcgtc gacagcagcg tcgggcgcct cggagccctg 4 80 
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tgctgtgccg aacacattca gccgttgtcg aaatacgcca tgtacgcgat gaacgagcaa 
attcatgtgg cgtcgtggcc atctttcacg ctctatcgcg gcaaagccta cgctttgggt 
catgaggtga atcttgccgc cagccaaatc tacgcgctcg aaggaggttg cttcgtcttg 
catgccacgg caattaccgg tcaggatatg ttcgacatgc tttgcgacac tccggaaagg 
gcggatttgc tgaatgcgga gggagcaaag ccgggtggag gctattcgat gatttttggt 
cccgatggtc agccgatgtg cgagcatctg ccgcaggaca aggaaggcat cctctatgcc 
ggcgtagacc tgtcgatgat tgcgatcgcc aaagcggcct acgatcctac ggggcactac 
gcccgcggtg atgtcgtccg tctcatggtc aaccgcagcc cccgtcgcac gagcgtcagc 
ttcagcgaag acgagaacgc ggcggtcact ttcaccgaga cctga 

<210> 190 
<211> 334 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<400> 190 , ^ „ . 

Met Gin Asp Thr Lys Phe Lys Val Ala Val Val Gin Ala Ala Pro Val 

1 5 10 15 

Phe Met Asp Ala Pro Ala Ser Val Ala Lys Ala He Gly Phe He Gin 

20 25 30 

Glu Ala Gly Ala Ala Gly Ala Lys Leu Leu Ala Phe Pro Glu Val Trp 

35 40 45 

He Pro Gly Tyr Pro Trp Trp Leu Trp Leu Gly Thr Pro Ala Trp Gly 

50 55 60 

Met Gin Phe Val Pro Arg Tyr His Ala Asn Ser Leu Arg Ala Asp Gly 
65 70 75 80 

Pro Glu He Leu Ala Leu Cys Ala Ala Ala Ala Glu Ala Lys He Asn 

85 90 95 

Val Val Met Gly Phe Ser Glu He Asp Gly Gly Thr Leu Tyr Leu Ser 

100 105 HO 

Gin Val Phe He Ser Asp Ala Gly Lys He He Phe Lys Arg Arg Lys 

115 120 125 

Leu Lys Pro Thr His Val Glu Arg Thr Leu Phe Gly Glu Gly Asp Gly 

130 135 140 

Ser Asp Phe Arg Val Val Asp Ser Ser Val Gly Arg Leu Gly Ala Leu 
145 150 155 160 

Cvs Cvs Ala Glu His He Gin Pro Leu Ser Lys Tyr Ala Met Tyr Ala 

* 165 170 175 

Met Asn Glu Gin He His Val Ala Ser Trp Pro Ser Phe Thr Leu Tyr 

180 185 190 

Arg Gly Lys Ala Tyr Ala Leu Gly His Glu Val Asn Leu Ala Ala Ser 

195 200 205 

Gin He Tyr Ala Leu Glu Gly Gly Cys Phe Val Leu His Ala Thr Ala 

210 215 220 

He Thr Gly Gin Asp Met Phe Asp Met Leu Cys Asp Thr Pro Glu Arg 
225 230 235 240 

Ala Asp Leu Leu Asn Ala Glu Gly Ala Lys Pro Gly Gly Gly Tyr Ser 

245 250 255 

Met He Phe Gly Pro Asp Gly Gin Pro Met Cys Glu His Leu Pro Gin 

260 265 270 

Asd Lvs Glu Gly He Leu Tyr Ala Gly Val Asp Leu Ser Met He Ala 

F 275 280 285 

He Ala Lys Ala Ala Tyr Asp Pro Thr Gly His Tyr Ala Arg Gly Asp 

290 295 300 

Val Val Arg Leu Met Val Asn Arg Ser Pro Arg Arg Thr Ser Val Ser 
305 310 315 320 

Phe Ser Glu Asp Glu Asn Ala Ala Val Thr Phe Thr Glu Thr 
325 330 



540 
600 
660 
720 
780 
840 
900 
960 
1005 
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<210> 191 
<211> 945 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 191 

atgaaaaagt tagctgtggt tcaacgtgcg tcaggatttt tagataagca gcagagcatc 60 

gcgttggcgg tggaaagtat tcagtctgct gcgaataatg gcgcagagct tgttgttttt 120 

acggaagcct ttattcctgg ttatcctgtc tggttatggc gtctgcgccc tggcaaagac 180 

tgggggacaa cagacagtct ttatcaacgc ttaataagca acgcggttga tttaagctca 24 0 

tcggatttgg atccgattta tgaagcggca aaacgtcatc acgtcacggt tgtatgcggc 300 

attaatgaac gcgactccag cgtcagccga acaacgctat acaacactta catcacggtt 360 

tgtcatgagg gcaatctcat caatgttcat cgaaaactga tgccgaccaa cccagagaga 420 

atggtgtggg gctttggtga tgcgactgga ttaagggtag tagacactcc tgtcggaagg 480 

attggctcac tcgtttgctg ggagaactac atgccgttgg cacgctatgc actttatgct 540 

cagggcgtcg aaatttacat tgcgcctact tatgacagtg gctcggactg gactgaaagc 600 

ttgcgccata tcgccagaga gggcagatgc tacgttgtcg gcagcggtaa cttgttgaga 660 

gccagcgacc tgcctgatga ttttccagaa aaagaaaccc tctatcctga taaagacgag 720 

tggattaacg gcggagactc taccgttatc gctcccggcg gtgaaacatt agttgctccg 780 

ctgcatgcag aggaaggcat actgtattgc gatattgata ctgataaagt ggcggcggct 840 

cggcgttctt tcgacgttgc aggccattac tctcgcccag acatatttac actcaacgta 900 

aatcgagcgc cgcaaacatc tctgcgtatc agggaagccg agtaa 945 

<210> 192 
<211> 314 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> .192 



Met 


Lys 


Lys 


Leu 


Ala 


Val 


Val 


Gin 


Arg Ala 


Ser Gly 


Phe 


Leu 


Asp Lys 


1 






5 








10 








15 


Gin 


Gin 


Ser 


He 


Ala 


Leu 


Ala 


Val 


Glu Ser 


He Gin 


Ser 


Ala 


Ala Asn 








20 










25 






30 




Asn 


Gly 


Ala 


Glu 


Leu 


Val 


Val 


Phe 


Thr Glu 


Ala Phe 


He 


Pro 


Gly Tyr 




35 










40 






45 






Pro 


Val 


Trp 


Leu 


Trp 


Arg 


Leu 


Arg 


Pro Gly 


Lys Asp 


Trp Gly Thr Thr 




50 










55 






60 








Asp 


Ser 


Leu 


Tyr 


Gin 


Arg 


Leu 


lie 


Ser Asn 


Ala Val 


Asp Leu 


Ser Ser 


65 










70 








75 






80 


Ser 


Asp 


Leu 


Asp 


Pro 


He 


Tyr 


Glu 


Ala Ala 


Lys Arg 


His 


His 


Val Thr 








85 








90 








95 


Val 


Val 


Cys 


Gly 


He 


Asn 


Glu 


Arg 


Asp Ser 


Ser Val 


Ser Arg 


Thr Thr 








100 










105 






110 




Leu 


Tyr 


Asn 


Thr 


Tyr 


He 


Thr 


Val 


Cys His 


Glu Gly 


Asn 


Leu 


He Asn 






115 










120 






125 






Val 


His 


Arg 


Lys 


Leu 


Met 


Pro 


Thr 


Asn Pro 


Glu Arg 


Met 


Val 


Trp Gly 




130 










135 






140 








Phe 


Gly 


Asp 


Ala 


Thr 


Gly 


Leu 


Arg 


Val Val 


Asp Thr 


Pro 


Val 


Gly Arg 


145 








150 








155 






160 


He 


Gly 


Ser 


Leu 


Val 


Cys 


Trp 


Glu 


Asn Tyr 


Met Pro 


Leu 


Ala 


Arg Tyr 








165 






170 








175 


Ala 


Leu 


Tyr 


Ala 


Gin 


Gly 


Val 


Glu 


He Tyr 


He Ala 


Pro 


Thr 


Tyr Asp 








180 










185 






190 




Ser 


Gly 


Ser 


Asp 


Trp 


Thr 


Glu 


Ser 


Leu Arg 


His He 


Ala 


Arg 


Glu Gly 
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195 


200 


205 








Tyr Val Val Gly 


Ser Gly Asn Leu Leu Arg 


Ala 


Ser Asp 


Leu 


210 




215 220 








Pro Asp 


Asp Phe Pro Glu 


Lys Glu Thr Leu Tyr Pro 


Asp 


Lys Asp 


Glu 


225 


230 


235 






240 


Trp lie 


Asn Gly Gly Asp 


Ser Thr Val lie Ala Pro 


Gly 


Gly Glu 


Thr 


245 


250 




255 




Leu Val 


Ala Pro Leu His 


Ala Glu Glu Gly lie Leu 


Tyr 


Cys Asp 


lie 




260 


265 




270 




Asp Thr 


Asp Lys Val Ala 


Ala Ala Arg Arg Ser Phe 


Asp 


Val Ala 


Gly 


275 


280 


285 






His Tyr 


Ser Arg Pro Asp 


lie Phe Thr Leu Asn Val 


Asn 


Arg Ala 


Pro 


290 




295 300 








Gin Thr 


Ser Leu Arg lie 


Arg Glu Ala Glu 








305 


310 











<210> 193 
<2U> 966 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 193 

atgtcaaacg agaaccacaa ccaaacattc aaagttgccg cggtgcaggc cacacctgta 60 

ttcctcgatc gtgaagcgac catcgacaaa gcttgtgagt tgattgctgc agccggcaat 120 

gaaggagcgc ggctggttgt cttcccggag gcattcatcc catcctaccc agattgggta 180 

tgggcaatcc caccgggcga agaaggcgtg ctcaatgagt tgtacgcgga actgctctcc 240 

aattcggtca cgattcccag tgatgtgacg gatagactgt gccgagccgc gagacttgcc 300 

aatgcctacg tagtgatggg gatgagcgaa cgcaatgccg aggccagtgg cgcaagcctg 360 

tataacacgc tgttgtacat cgatgcgcag ggcgagattc tgggcaaaca tcgaaagctg 420 

gtgcccacag gcggcgaacg gctggtgtgg gcgcagggcg atggcagcac gctgcaggtc 480 

tacgatactc cactgggtaa actcggcggt ttgatttgct gggagaatta tatgccgctg 540 

gcccgctaca ccatgtacgc atggggcaca caaatctatg ttgcggcgac atgggatcgc 600 

gggcaaccct ggctctccac tttacggcat atcgccaaag aaggcagggt gtacgtgatc 660 

ggctgctgta tcgtgatgcg caaagacgat atcccagatc gttacccgat gaagcagaag 720 

ttttacgcgg aggccgatga gtggatcaac ataggggaca gcgcaatcgt caatcctgaa 780 

gggcagttta gcgccgggcc ggtacgcaaa caggaagaga ttctctacgc ggaaattgat 840 

ccgcgcatgg tgcaaggccc gaagtggatg ctcgacgtag cagggcacta cgcgaggccg 900 

gacgtattcc agttgacggt gcatacggat gcgaggcaga tgatcaggtt ggaacacgat 960 

gtttaa 966 

<210> 194 

<211> 321 

<212> PRT 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 194 

Met Ser Asn Glu Asn His Asn Gin Thr Phe Lys Val Ala Ala Val Gin 

15 10 15 

Ala Thr Pro Val Phe Leu Asp Arg Glu Ala Thr lie Asp Lys Ala Cys 

20 25 30 

Glu Leu He Ala Ala Ala Gly Asn Glu Gly Ala Arg Leu Val Val Phe 

35 40 45 

Pro Glu Ala Phe He Pro Ser Tyr Pro Asp Trp Val Trp Ala He Pro 

50 55 60 

Pro Gly Glu Glu Gly Val Leu Asn Glu Leu Tyr Ala Glu Leu Leu Ser 
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65 70 75 80 

Asn Ser Val Thr lie Pro Ser Asp Val Thr Asp Arg Leu Cys Arg Ala 

85 90 95 

Ala Arg Leu Ala Asn Ala Tyr Val Val Met Gly Met Ser Glu Arg Asn 

100 105 110 

Ala Glu Ala Ser Gly Ala Ser Leu Tyr Asn Thr Leu Leu Tyr He Asp 

115 120 125 

Ala Gin Gly Glu He Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly 

130 135 140 

Gly Glu Arg Leu Val Trp Ala Gin Gly Asp Gly Ser Thr Leu Gin Val 
145 150 155 160 

Tyr Asp Thr Pro Leu Gly Lys Leu Gly Gly Leu He Cys Trp Glu Asn 

165 170 175 

Tyr Met Pro Leu Ala Arg Tyr Thr Met Tyr Ala Trp Gly Thr Gin He 

180 185 190 

Tyr Val Ala Ala Thr Trp Asp Arg Gly Gin Pro Trp Leu Ser Thr Leu 

195 200 205 

Arg His He Ala Lys Glu Gly Arg Val Tyr Val He Gly Cys Cys He 

210 215 220 

Val Met Arg Lys Asp Asp He Pro Asp Arg Tyr Pro Met Lys Gin Lys 
225 230 235 240 

Phe Tyr Ala Glu Ala Asp Glu Trp He Asn He Gly Asp Ser Ala He 

245 250 255 

Val Asn Pro Glu Gly Gin Phe Ser Ala Gly Pro Val Arg Lys Gin Glu 

260 265 270 

Glu He Leu Tyr Ala Glu He Asp Pro Arg Met Val Gin Gly Pro Lys 

275 280 285 

Trp Met Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Gin 

290 295 300 

Leu Thr Val His Thr Asp Ala Arg Gin Met He Arg Leu Glu His Asp 
305 310 315 320 

Val 



<210> 195 
<211> 993 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 195 

atgaaagtcg tcaaagccgc cgcggtacag ttcagcccgg tgctctatag ccgtgaggca 60 

accgtcgcca aggtcgtgca gaagatccac gaactcggcc agaaaggcgt gcagttcgcc 120 

accttccccg aaacggtcgt gccttattac ccttactttg cggccgtcca gacgggcatc 180 

gagcttctct cgggcaccga acatctgcgc ctgctcgaac aggctgtgac tgtcccgtcc 240 

gctgctaccg acgcgatcgg cgaagccgcg cgaaaggccg gcatggtcgt gtccattggc 300 

gtcaatgagc gcgatggcgg cacgctgtac aacgcacaac tgctcttcga tgccgacggt 360 

acgctgatcc agcgccgccg caagatcacg ccgacgcatt tcgaacgcat gatctggggc 420 

cagggagatg gctcgggctt gcgtgcagtc gacagcgccg tcggccgcgt cggccagctc 480 

gcatgtttcg agcacaacaa cccgctcgcc cgctacgcaa tgatcgccga cggcgagcag 540 

atccattcgg cgatgtaccc tggctcggcc tttggcgagg gcttcgccca gcgtatggaa 600 

atcaacatcc gccagcatgc gctcgagtcc gccgctttcg tcgtcaacgc aacagcgtgg 660 

ctggacgccg accagcaggc gcaaatcatg aaggacaccg gttgtggaat cggtccgatc 720 

acgggcggct gcttcaccac gatcgtctct cctgacggca tgctgatggc cgagccgctt 780 

cgctcgggtg aaggcgaagt gatcgtcgat ctcgacttca cgcagatcga ccgccgcaag 840 

atgctgatgg actcggccgg ccactacaac cgccctgaac tgctgagtct gatgatcgac 900 

cgtacgccga ccgcgcatgt tcacgaacgc gcttcgcacc cgatgatcgt caacgaccag 960 
ggttccgacg atctgcgcac ccaggctgca tga 
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<210> 196 
<211> 330 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 196 

Met Lys Val Val Lys Ala Ala Ala Val Gin Phe Ser Pro Val Leu Tyr 

15 10 15 

Ser Arg Glu Ala Thr Val Ala Lys Val Val Gin Lys lie His Glu Leu 

20 25 30 

Gly Gin Lys Gly Val Gin Phe Ala Thr Phe Pro Glu Thr Val Val Pro 

35 40 45 

Tyr Tyr Pro Tyr Phe Ala Ala Val Gin Thr Gly lie Glu Leu Leu Ser 

50 55 60 

Gly Thr Glu His Leu Arg Leu Leu Glu Gin Ala Val Thr Val Pro Ser 
65 70 75 80 

Ala Ala Thr Asp Ala lie Gly Glu Ala Ala Arg Lys Ala Gly Met Val 

85 90 95 

Val Ser lie Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn Ala 

100 105 110 

Gin Leu Leu Phe Asp Ala Asp Gly Thr Leu lie Gin Arg Arg Arg Lys 

115 120 125 

He Thr Pro Thr His Phe Glu Arg Met He Trp Gly Gin Gly Asp Gly 

130 135 140 

Ser Gly Leu Arg Ala Val Asp Ser Ala Val Gly Arg Val Gly Gin Leu 
145 150 155 . 160 

Ala Cys Phe Glu His Asn Asn Pro Leu Ala Arg Tyr Ala Met He Ala 

165 170 175 

Asp Gly Glu Gin He His Ser Ala Met Tyr Pro Gly Ser Ala Phe Gly 

180 185 190 

Glu Gly Phe Ala Gin Arg Met Glu He Asn He Arg Gin His Ala Leu 

195 200 205 

Glu Ser Ala Ala Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp 

210 215 220 

Gin Gin Ala Gin He Met Lys Asp Thr Gly Cys Gly He Gly Pro He 
225 230 235 240 

Thr Gly Gly Cys Phe Thr Thr He Val Ser Pro Asp Gly Met Leu Met 

245 250 255 

Ala Glu Pro Leu Arg Ser Gly Glu Gly Glu Val He Val Asp Leu Asp 

260 265 270 

Phe Thr Gin He Asp Arg Arg Lys Met Leu Met Asp Ser Ala Gly His 

275 280 285 

Tyr Asn Arg Pro Glu Leu Leu Ser Leu Met He Asp Arg Thr Pro Thr 

290 295 300 

Ala His Val His Glu Arg Ala Ser His Pro Met He Val Asn Asp Gin 
305 310 315 320 

Gly Ser Asp Asp Leu Arg Thr Gin Ala Ala 
325 330 

<210> 197 
<211> 1017 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 197 
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atgggcatcg 
gacctcgatc 
gccagattga 
gactcgccgg 
gcttatgata 
gccgtgatcg 
ggccccgacg 
acggtctatg 
cggctcggcg 
gcccagaacg 
gccccggcgc 
tcctgcttcg 
gaccggcccg 
cccgacggca 
gacatcgatc 
tcgcggcccg 
tttgctttgc 



aacatccaaa 
gctcgatcga 
tcgcattccc 
cctgggcgat 
gtccgcaggc 
gcctttcgga 
gcgagaccat 
gcgagggcga 
cgctgtgctg 
agcaggtcca 
tcggcgcgga 
tgctggcgcc 
acaagcacgc 
gttcgctggc 
tcggcgcgat 
atgtcacgcg 
ccggcgatac 



ataccgcgtt 
caaggctatt 
ggaggtcttc 
cggccgcggc 
cgagcggctc 
gcggagcggc 
cgcgaagcgc 
cggcagcgat 
ctgggaacat 
tgtggcgtca 
ggtgaacaat 
gtgcgcgacc 
gctgctccat 
cgaaaagctc 
cggcgtcgcg 
gctgctgctg 
cgtggcgcct 



gccgccgtgc 
gcactgatcg 
atccccggct 
ttcgtgcagc 
cgcgcagcgg 
ggcagcctct 
cgcaagctcc 
ctggcggtcc 
ctgcaaccgt 
tggccgagtt 
gcggcctccc 
gtctcgcagg 
gccggcggtg 
gcgccggacc 
aagaacgccg 
aacaacaagc 
gccgatgtgg 



aggctgcgcc 
aggaggccgc 
acccctggca 
gttatttcga 
tccgcaaggc 
acatcgcgca 
gtccgacgca 
atgaccggcc 
tgtcgaaata 
tttcgctcta 
gggtctatgc 
ccatgatcga 
gctttgccgc 
aggagggcct 
ccgacccggc 
cctacaagcg 
atgcggcggc 



ggcctggctc 
cgcgaacggc 
tatctggctc 
caattcgctc 
gcgcctgacc 
atggctcgtt 
tgccgagcgc 
cgatatcgga 
tgcgatgtat 
cgatcccttt 
ggtcgaaggc 
tgagctgtgc 
gatctacggg 
gctttacgcc 
agggcattat 
cgtggagcat 
gagctga 



<210> 198 
<211> 338 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1017 



<400> 198 

Met Gly lie Glu His Pro Lys Tyr Arg Val Ala 

15 10 
Pro Ala Trp Leu Asp Leu Asp Arg Ser He Asp 

20 25 
He Glu Glu Ala Ala Ala Asn Gly Ala Arg Leu 

35 40 
Val Phe lie Pro Gly Tyr Pro Trp His He Trp 

50 55 
Trp Ala He Gly Arg Gly Phe Val Gin Arg Tyr 
65 70 75 

Ala Tyr Asp Ser Pro Gin Ala Glu Arg Leu Arg 

85 90 
Ala Arg Leu Thr Ala Val He Gly Leu Ser Glu 

100 105 
Leu Tyr He Ala Gin Trp Leu Val Gly Pro Asp 

115 120 
Lys Arg Arg Lys Leu Arg Pro Thr His Ala Glu 

130 135 
Glu Gly Asp Gly Ser Asp Leu Ala Val His Asp 
145 150 155 

Arg Leu Gly Ala Leu Cys Cys Trp Glu His Leu 

165 170 
Tyr Ala Met Tyr Ala Gin Asn Glu Gin Val His 

180 185 
Ser Phe Ser Leu Tyr Asp Pro Phe Ala Pro Ala 

195 200 
Asn Asn Ala Ala Ser Arg Val Tyr Ala Val Glu 

210 215 
Leu Ala Pro Cys Ala Thr Val Ser Gin Ala Met 
225 230 235 

Asp Arg Pro Asp Lys His Ala Leu Leu His Ala 

245 250 
Ala He Tyr Gly Pro Asp Gly Ser Ser Leu Ala 
260 265 



Ala Val Gin Ala Ala 
15 

Lys Ala He Ala Leu 
30 

He Ala Phe Pro Glu 
45 

Leu Asp Ser Pro Ala 
60 

Phe A3p Asn Ser Leu 
80 

Ala Ala Val Arg Lys 
95 

Arg Ser Gly Gly Ser 
110 

Gly Glu Thr He Ala 
125 

Arg Thr Val Tyr Gly 
140 

Arg Pro Asp He Gly 
160 

Gin Pro Leu Ser Lys 
175 

Val Ala Ser Trp Pro 
190 

Leu Gly Ala Glu Val 
205 

Gly Ser Cys Phe Val 
220 

He Asp Glu Leu Cys 
240 

Gly Gly Gly Phe Ala 
255 

Glu Lys Leu Ala Pro 
270 
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Asp Gin Glu Gly Leu Leu Tyr Ala 
275 280 
Val Ala Lys Asn Ala Ala Asp Pro 

290 295 
Val Thr Arg Leu Leu Leu Asn Asn 
305 310 
Phe Ala Leu Pro Gly Asp Thr Val 
325 

Ala Ser 



Asp lie Asp Leu Gly Ala He Gly 
285 

Ala Gly His Tyr Ser Arg Pro Asp 
300 

Lys Pro Tyr Lys Arg Val Glu His 
315 320 
Ala Pro Ala Asp Val Asp Ala Ala 
330 335 



<210> 199 
<211> 993 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 199 

atgtcaaacg cactgaagcc gttcaaaatc gccgccgttc aggcgacgcc ggtttttctg 60 

aaccgggaag ccacggcgga gaaggccgcg gctttgatcc gcgaagcggg aagcgccgga 120 

gccaagctca tcgttttccc ggaatcgttt attccggcct atccggactg ggtctgggtg 180 

gtcccctcgg ggagggatcg ccttctcagc ggcctctacg gggagatgct cgaaaacgcc 240 

gtggaaatcc ccggcccggc cacggggcat atcggccggg cggcgaagga atcgggcgct 300 

tatgtcgtca tgggcgtgac cgagcgggac acggaggcga gcggagccag tttgttcaac 360 

accttgattt atttcggtcc gaccggggaa attttgggca aacaccggaa gctggttccc 420 

accgggggcg aacggatcgt ctgggcccag ggggacggaa gcaccctgga ggtctacgat 480 

acgcccctgg gaaaactggg cgggctgatc tgctgggaaa actacatgcc cctggcccgg 540 

tacgccatgt acgcctgggg aacccagctt tacgtggccg ccacctggga ccgaggcgaa 600 

ccctggcttt cgacgcttcg gcatatcgcc aaggaagggc gggtgtatgt catcgggtgc 660 

tgcatcgcca tgcggaaagg ggatatcccg gatcggttcg aacacaaggg gctctacgcc 720 

cccgaccggg actggatcaa ccccggcgac agcgcgatcg tcaaccccca gggggagatg 780 

atcgccgggc ccgcttccaa taaggaagag atcctttatg cggaagtcga cccgcagatg 840 

atgcgcgggc ccaaatggat gctcgatgtg gccggccatt acgcgcggcc cgatgtcttc 900 

gagctcaccg tccgccggga accgcggccg atgatccgcg tggcgggagg cgcgggcggg 960 

accgaaccca aagagaagaa gaccgccggc tga 993 

<210> 200 
<211> 330 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 200 



Met 


Ser 


Asn 


Ala 


Leu 


Lys 


Pro 


Phe 


Lys 


He 


Ala 


Ala Val 


Gin 


Ala 


Thr 


1 








5 








10 








15 




Pro 


Val 


Phe 


Leu 


Asn 


Arg 


Glu 


Ala 


Thr 


Ala 


Glu Lys Ala 


Ala 


Ala 


Leu 








20 










25 








30 






He 


Arg 


Glu 


Ala 


Gly 


Ser 


Ala 


Gly 


Ala 


Lys 


Leu 


He Val 


Phe 


Pro 


Glu 




35 










40 








45 








Ser 


Phe 
50 


He 


Pro 


Ala 


Tyr 


Pro 
55 


Asp 


Trp 


Val 


Trp 


Val Val 
60 


Pro 


Ser 


Gly 


Arg 


Asp 


Arg 


Leu 


Leu 


Ser 


Gly 


Leu 


Tyr 


Gly 


Glu 


Met Leu 


Glu 


Asn 


Ala 


65 








70 










75 








80 


Val 


Glu 


He 


Pro 


Gly 
85 


Pro 


Ala 


Thr 


Gly 


His 
90 


He 


Gly Arg 


Ala 


Ala 
95 


Lys 


Glu 


Ser 


Gly 


Ala 


Tyr 


Val 


Val 


Met 


Gly 


Val 


Thr 


Glu Arg 


Asp 


Thr 


Glu 






100 








105 








110 
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Ala Ser 


Gly 


Ala Ser Leu 


fne Asn inr i>eu 


He Tyr 


true uiy 


Pro 


Thr 












125 






Gly Glu 


lie 


Leu Gly Lys 


His Arg Lys Leu 


vai. fro 


inr vaxy 


r-*\ \r 


Glu 










140 








Arg lie 


val 


l rp /via uin 


uxy Asp uj.y Dei 


Thr Leu 


O "1 w- 1 
ulU vol 


Tyr 




1 A ti 








ijj 






160 


Thr Pro 


Leu 


Gly Lys Leu 


rl rl„ Tail T 1 Q 

uiy vjjiy iieu j. j.e 


Cys Trp 




Tyr 


rlel 






ICC 

loo 


X /U 






X f D 




Pro Leu 


Ala 


Arg Tyr Ala 


Met; Tyr Aia lrp 


uiy inr 


Gin Leu 


Tyr 


vai 






180 


lOD 




i on 






Ala Ala 


Thr 


Trp Asp Arg 


Gly Glu Pro Trp 


Leu Ser 


Thr Leu 


Arg 


his 




i tic 
195 




O A A 




9 ACL 






lie Ala 


Lys 


Glu Gly Arg 


Val Tyr val lie 


Gly Cys 


Cys He 


Ala 


Met 


210 
















Arg Lys 


Gly 


Asp lie Pro 


Asp Arg Phe Glu 


His Lys 


Gly Leu 


Tyr 


Ala 


225 




230 




o *a d 






240 


Pro Asp 


Arg 


Asp Trp lie 


Asn Pro Gly Asp 


Ser Ala 


He Val 


Asn 


Pro 






245 


250 






255 




Gin Gly 


Glu 


Met lie Ala 


Gly Pro Ala Ser 


Asn Lys 


Glu Glu 


He 


Leu 




260 


265 




270 






Tyr Ala 


Glu 


Val Asp Pro 


Gin Met Met Arg 


Gly Pro 


Lys Trp 


Met 


Leu 


275 




280 




285 






Asp Val 


Ala 


Gly His Tyr 


Ala Arg Pro Asp 


Val Phe 


Glu Leu 


Thr 


Val 


290 






295 


300 








Arg Arg 


Glu 


Pro Arg Pro 


Met lie Arg Val 


Ala Gly 


Gly Ala 


Gly Gly 


305 




310 




315 






320 


Thr Glu 


Pro 


Lys Glu Lys 


Lys Thr Ala Gly 














325 


330 











<210> 201 
<211> 930 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 201 

atgaccacga agagcatccg catcgcggcc gtacaggcgg ctcccgcgtt tctcgacctg 60 

gctggtacgc tggaccggct cgaggcctgg gcccgcaagg ccgccgccac cggtgcccgc 120 

gtcatcgcgt tccccgagac ctggctgccg ggctacccgg cgtggatcga ctcgtcgccg 180 

gaggccgcga tctggggcca tcccggctcg cgcgacctgc accagcgcct gatggagaat 240 

gccgtcgagg tcccgggccc cgcgaccgcg cgcatcgcga agctcgccgg cgagctcggc 300 

gtgacgatcg tggtcggcgc gcacgagcgg gcggggaaca ccctctacaa cacggcgctg 360 

acgttcgggc ccgagggcag gctgctcaat caccaccgga agctggtgcc gacctacagc 420 

gaacggctgc tgtggggcta cggcgacggc gctggactgg tggcgccggc ggtggacggt 4 80 

gtgaaggtcg gggcgctggt gtgctgggag cactggatgc cgctcacccg ccaggcgatg 540 

cacgacgtcg gcgagcacgt gcacgtcgcc ctgtggcccg gcgtccacga gatgcaccag 600 

gtggcctcgc ggcactatgc gttcgagggc cgctgtttcg tgatcgcggt cgggagcatc 660 

ctgcgcgtgg accagatgcc gaagcagctg ccgccgctgg agaagtacgc gaagagcgcc 720 

aaggggctga tgatcgcggg cggcagcgcc atcatcgcgc cgaacggccg ctacgtcgcg 780 

gcgccggtgt acgacgagga gacgatcgtc accgccgact gcgacctcgg cgagatcccg 840 

cgcgaggcgc agacgctcga tgtctcgggc cactacagcc ggccggacgt gttcagcttc 900 

ggggtggtca gacaccggcc gcgtgcgtaa 930 

<210> 202 

<211> 309 

<212> PRT 

<213> Unknown 

<220> 
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<223> Obtained from an environmental sample 
<400> 202 

Met Thr Thr Lys Ser lie Arg lie Ala Ala Val Gin Ala Ala Pro Ala 

15 10 15 

Phe Leu Asp Leu Ala Gly Thr Leu Asp Arg Leu Glu Ala Trp Ala Arg 

20 25 30 

Lys Ala Ala Ala Thr Gly Ala Arg Val He Ala Phe Pro Glu Thr Trp 

35 40 45 

Leu Pro Gly Tyr Pro Ala Trp He Asp Ser Ser Pro Glu Ala Ala He 

50 55 60 

Trp Gly His Pro Gly Ser Arg Asp Leu His Gin Arg Leu Met Glu Asn 
65 70 75 80 

Ala Val Glu Val Pro Gly Pro Ala Thr Ala Arg He Ala Lys Leu Ala 

85 90 95 

Gly Glu Leu Gly Val Thr He Val Val Gly Ala His Glu Arg Ala Gly 

100 105 110 

Asn Thr Leu Tyr Asn Thr Ala Leu Thr Phe Gly Pro Glu Gly Arg Leu 

115 120 125 

Leu Asn His His Arg Lys Leu Val Pro Thr Tyr Ser Glu Arg Leu Leu 

130 135 140 

Trp Gly Tyr Gly Asp Gly Ala Gly Leu Val Ala Pro Ala Val Asp Gly 
145 150 155 160 

Val Lys Val Gly Ala Leu Val Cys Trp Glu His Trp Met Pro Leu Thr 

165 170 175 

Arg Gin Ala Met His Asp Val Gly Glu His Val His Val Ala Leu Trp 

180 185 190 

Pro Gly Val His Glu Met His Gin Val Ala Ser Arg His Tyr Ala Phe 

195 200 205 

Glu Gly Arg Cys Phe Val He Ala Val Gly Ser He Leu Arg Val Asp 

210 215 220 

Gin Met Pro Lys Gin Leu Pro Pro Leu Glu Lys Tyr Ala Lys Ser Ala 
225 230 235 240 

Lys Gly Leu Met He Ala Gly Gly Ser Ala He He Ala Pro Asn Gly 

245 250 255 

Arg Tyr Val Ala Ala Pro Val Tyr Asp Glu Glu Thr He Val Thr Ala 

260 265 270 

Asp Cys Asp Leu Gly Glu He Pro Arg Glu Ala Gin Thr Leu Asp Val 

275 280 285 

Ser Gly His Tyr Ser Arg Pro Asp Val Phe Ser Phe Gly Val Val Arg 

290 295 300 

His Arg Pro Arg Ala 
305 

<210> 203 
<211> 966 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 203 

atgtcaagct tacccacatc cgcgttcacc gtcgccgccg cgcaagcgtc gccagtgttc 60 

ctcgaccgcg acgcgacgct gcagaaggct tgcgggctga tcgccgacgc cgggcgcgcg 120 

ggcgcgcgcc tgatcgtctt ccccgaagcc ttcatcccgg cctaccccga ttgggtgtgg 180 

gcggtcccag ctggcgaaga ggggatgctg agcgagctct acgccgagct ggtcgcgaat 240 

tcgctggcta ttccgagcga cgcgaccgat cggctatgtc gcgcggcgca ggccgcgcat 300 

atcaatgtgg tcgtggggtt gagcgagcgc aatgtcgagg ccagcggcgc cagcctctac 360 

aacacgctgc tgtatatcga cgcggcggga acgatcctgg gtaaacaccg caagcttgtg 420 

ccgaccggcg gggagcgcct ggtctgggcg cagggcgacg gcagcacgct cgatgtgtac 480 
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gacaccgcgc tcggcaagct cggcggcctg atctgttggg aaaactacat gccgctggca 540 

cgctacgcgc tgtacgcctg gggtgtgcaa atctatgtcg cggccacctg ggatcgcggc 600 

gagccctggc tttctactct gcgacatatc gccaaggaag gccgtgtcta cgtgatcggc 660 

tgtggcatgg cgctgcgcag agatgatatt cccgatcgct tcgctttcaa gcagcgcttc 720 

tatgcccagg ccggcgaatg gatcaacgtc ggcgacagcg cgatcgtcaa cccgagcggc 780 

gagtttattg ccggacctgt gcgcgaacgc gaggagattc tgtacgcgga ggtcgacccg 840 

gagcagatga gcgggccaaa gtggatgctc gacgtggccg ggcactacgg gcggccggat 900 

gtcttccggc tcagcgtcaa ccgggcgccg caccagatga tccagacgga gaaccgggag 960 

acctga 966 

<210> 204 
<211> 321 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 204 



Met 


Ser 


Ser 


Leu 


Pro 


Thr 


Ser 


Ala 


Phe Thr 


Val Ala Ala Ala Gin Ala 


1 








5 








10 


15 


Ser 


Pro 


Val 


Phe 


Leu 


Asp 


Arg 


Asp 


Ala Thr 


Leu Gin Lys Ala Cys Gly 








20 










25 


30 


Leu 


lie 


Ala 


Asp 


Ala 


Gly 


Arg 


Ala 


Gly Ala 


Arg Leu He Val Phe Pro 






35 










40 




45 


Glu 


Ala 


Phe 


He 


Pro 


Ala 


Tyr 


Pro 


Asp Trp 


Val Trp Ala Val Pro Ala 




50 










55 






60 


Gly 


Glu 


Glu 


Gly 


Met 


Leu 


Ser 


Glu 


Leu Tyr 


Ala Glu Leu Val Ala Asn 


65 








70 








75 80 


Ser 


Leu 


Ala 


He 


Pro 


Ser 


Asp 


Ala 


Thr Asp 


Arg Leu Cys Arg Ala Ala 










85 








90 


95 


Gin 


Ala 


Ala 


His 


He 


Asn 


Val 


Val 


Val Gly 


Leu Ser Glu Arg Asn Val 








100 










105 


110 


Glu 


Ala 


Ser 


Gly 


Ala 


Ser 


Leu 


Tyr 


Asn Thr 


Leu Leu Tyr He Asp Ala 






115 










120 




125 


Ala 


Gly 


Thr 


He 


Leu 


Gly 


Lys 


His 


Arg Lys 


Leu Val Pro Thr Gly Gly 




130 










135 






140 


Glu 


Arg 


Leu 


Val 


Trp 


Ala 


Gin 


Gly 


Asp Gly 


Ser Thr Leu Asp Val Tyr 


145 










150 








155 160 


Asp 


Thr 


Ala 


Leu 


Gly 


Lys 


Leu 


Gly 


Gly Leu 


He Cys Trp Glu Asn Tyr 










165 








170 


175 


Met 


Pro 


Leu 


Ala 


Arg 


Tyr 


Ala 


Leu 


Tyr Ala 


Trp Gly Val Gin He Tyr 








180 










185 


190 


Val 


Ala 


Ala 


Thr 


Trp 


Asp 


Arg 


Gly 


Glu Pro 


Trp Leu Ser Thr Leu Arg 






195 










200 




205 


His 


He 


Ala 


Lys 


Glu 


Gly 


Arg 


Val 


Tyr Val 


He Gly Cys Gly Met Ala 




210 










215 






220 


Leu 


Arg 


Arg 


Asp 


Asp 


He 


Pro 


Asp 


Arg Phe 


Ala Phe Lys Gin Arg Phe 


225 










230 








235 240 


Xyr 


Ala 


Gin 


Ala 


Gly 


Glu 


Trp 


He 


Asn Val 


Gly Asp Ser Ala He Val 








245 








250 


255 


Asn 


Pro 


Ser 


Gly 


Glu 


Phe 


He 


Ala 


Gly Pro 


Val Arg Glu Arg Glu Glu 








260 










265 


270 


He 


Leu 


Tyr 


Ala 


Glu 


Val 


Asp 


Pro 


Glu Gin 


Met Ser Gly Pro Lys Trp 






275 










280 




285 


Met 


Leu 


Asp 


Val 


Ala 


Gly 


His 


Tyr 


Gly Arg 


Pro Asp Val Phe Arg Leu 




290 










295 






300 


Ser 


Val 


Asn 


Arg 


Ala 


Pro 


His 


Gin 


Met He 


Gin Thr Glu Asn Arg Glu 



305. 310 315 320 

Thr 
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<210> 205 
<211> 969 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 205 

atgaccaccg taaaagccgc cgcagtacag atcagccccg tgctctatag ccgggaggcc 60 

accgtagaca aggtcgttcg caagatccgc gagctcggcc aaaagggagt gcagttcgcc 120 

accttcccgg aaaccgtagt gccgtactac ccctacttcg ctgcagtcca gacaggcatc 180 

gaactgttgt ccggcaagga acacatgcgc ctgctggagc aggccgttac cgtcccctcg 240 

cccgccacgg atgcgattgc tcaggcggcg cgcgaagcca atatggtggt gtccatcggc 300 

gtcaacgagc gcgacggcgg caccatctac aacacgcagc tgctcttcga tgccgacggc 360 

acgctcgtgc agcgccgccg caagataacg ccaacgcact tcgagcgcat ggtctggggc 420 

cagggcgatg gttcgggatt gcgcgccgcg gacaccaagg ttggccgcat cggccagttg 480 

gcctgcttcg agcacaacaa cccgctcgcc cgttacgcca tgatggccga tggcgagcag 540 

atccactccg ccatgtaccc gggctcggcc ttcggcgagg gcttcgcgca gcgcatggag 600 

atcaacatcc gccagcatgc cctggagtct ggctgcttcg tggtgaatgc gaccgcctgg 660 

ctcgatgccg accaacaggc gcagatcatg aaggacaccg gttgctcgat cggcccgatc 720 

tccggcggct gcttcacgac catcgtcacg cctgagggca tgctgattgg cgagccgctc 780 

cgcgagggcg aaggcgaaat catcgccgac ctcgatttct cgatgatcga tcgccgcaag 840 

ctgctgatgg actcggtcgg tcactacaac cgtccggagc ttctgagcct cctgatcgat 900 

cgcacgcctg ccgcgaactt ccatgaacgt accgcgagcc aggcgaacgc cggcgtcgaa 960 

atcctctga 969 

<210> 206 
<211> 322 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 206 



Met 


Thr 


Thr 


Val 


Lys 


Ala 


Ala 


Ala 


Val 


Gin 


He Ser Pro Val Leu Tyr 


1 








5 










10 


15 


Ser 


Arg 


Glu 


Ala 


Thr 


Val 


Asp 


Lys 


Val 


Val 


Arg Lys He Arg Glu Leu 






20 










25 




30 


Gly 


Gin 


Lys 


Gly 


Val 


Gin 


Phe 


Ala 


Thr 


Phe 


Pro Glu Thr Val Val Pro 




35 










40 






45 


Tyr 


Tyr 


Pro 


Tyr 


Phe 


Ala 


Ala 


Val 


Gin 


Thr 


Gly He Glu Leu Leu Ser 


50 










55 








60 


Gly 


Lys 


Glu 


His 


Met 


Arg 


Leu 


Leu 


Glu 


Gin 


Ala Val Thr Val Pro Ser 


65 








70 










75 80 


Pro 


Ala 


Thr 


Asp 


Ala 


He 


Ala 


Gin 


Ala 


Ala 


Arg Glu Ala Asn Met Val 








85 










90 


95 


Val 


Ser 


lie 


Gly 


Val 


Asn 


Glu 


Arg 


Asp Gly 


Gly Thr He Tyr Asn Thr 








100 










105 




110 


Gin 


Leu 


Leu 


Phe 


Asp 


Ala 


Asp 


Gly 


Thr 


Leu 


Val Gin Arg Arg Arg Lys 






115 










120 






125 


lie 


Thr 


Pro 


Thr 


His 


Phe 


Glu 


Arg 


Met 


Val 


Trp Gly Gin Gly Asp Gly 




130 










135 








140 


Ser 


Gly 


Leu 


Arg 


Ala 


Ala 


Asp 


Thr 


Lys 


Val 


Gly Arg He Gly Gin Leu 


145 








150 










155 160 


Ala 


Cys 


Phe 


Glu 


His 


Asn 


Asn 


Pro 


Leu 


Ala 


Arg Tyr Ala Met Met Ala 








165 










170 


175 


Asp 


Gly 


Glu 


Gin 


lie 


His 


Ser 


Ala 


Met 


Tyr 


Pro Gly Ser Ala Phe Gly 




180 










185 




190 
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Glu Gly 


Phe 


Ala 


Gin Arg 


Met 


Glu 




195 








200 


Glu Ser 


Gly Cys 


Phe Val 


Val 


Asn 


210 








215 




Gin Gin 


Ala 


Gin 


He Met 


Lys 


Asp 


225 






230 






Ser Gly 


Gly Cys 


Phe Thr 


Thr 


He 








245 






Gly Glu 


Pro 


Leu 


Arg Glu 


Gly 


Glu 






260 








Phe Ser 


Met 


He 


Asp Arg 


Arg 


Lys 




275 








280 


Tyr Asn 


Arg 


Pro 


Glu Leu 


Leu 


Ser 


290 








295 




Ala Asn 


Phe 


His 


Glu Arg 


Thr 


Ala 


305 






310 







He Leu 



He 


Asn 


He Arg 


Gin 


His Ala 


Leu 










205 






Ala 


Thr 


Ala 


Trp 
220 


Leu 


Asp Ala 


Asp 


Thr 


Gly 


Cys 
235 


Ser 


He 


Gly Pro 


He 
240 


Val 


Thr 
250 


Pro 


Glu 


Gly 


Met Leu 
255 


He 


Gly 


Glu 


He 


He 


Ala 


Asp Leu 


Asp 


265 










270 




Leu 


Leu 


Met 


Asp 


Ser 


Val Gly His 










285 






Leu 


Leu 


He 


Asp Arg 


Thr Pro 


Ala 








300 








Ser 


Gin 


Ala 
315 


Asn 


Ala 


Gly Val 


Glu 
320 



<210> 207 
<211> 966 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 207 

atgtcaaacg agaacaacaa cgctacattc aaagttgccg cagtacaggc tacacctgtt 60 

tttctcgatc gtgaagcgac tctcgacaag gcttgcgatt tgatcgccgc cgccggaggt 120 

gaaggggcac gattggttgt ctttccagaa gccttcatac cggcctatcc ggattgggta 180 

tgggcaatcc caccgggtga agagggcgta cttaatgagt tgtacgcaga gctgctctcc 240 

aactcggtca cgattcccag tgacgcgacg gacagactgt gccgggccgc gaggcttgct 300 

aatgcttacg tggtgatggg gataagcgaa cgcaatgtcg aggcgagtgg agcaagcctg 360 

tataacacgc tgttgtacat cgatgcgcag ggtgagattc taggcaaaca tcgaaagcta 420 

gtgccaacgg gcggcgagcg gctggtgtgg gcgcagggcg atggcagcac actgcaggtc 480 

tacgatactc cactgggaaa actcggcggt ttaatttgct gggagaatta tatgccgctg 540 

gcccgctata ccatgtatgc ctggggcaca caaatctatg tcgccgctac gtgggatcgc 600 

gggcaaccct ggctctccac tttgcggcat atcgccaaag aaggcagggt gtacgtgatt 660 

ggttgttgta tcgcgatgcg caaagacgat atccctgatc gttacgcaat gaagcagaag 720 

ttttacgcgg aggcagatga gtggatcaat ataggtgaca gcgcgattgt caatcctgaa 780 

gggcaattta tcgcagggcc agtacgcaag caggaagaga ttctctacgc agagattgat 84 0 

ccgcgcatgg tacaagggcc gaagtggatg ctcgacgtgg cggggcacta tgccaggccg 900 

gatgtgttcc agttgacggt gcatacggat gtgcgacaga tgattcggat ggaacacgat 960 

tcttaa 966 

<210> 208 
<211> 321 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 208 

Met Ser Asn Glu Asn Asn Asn Ala Thr Phe Lys Val Ala Ala Val Gin 

15 10 15 

Ala Thr Pro Val Phe Leu Asp Arg Glu Ala Thr Leu Asp Lys Ala Cys 

20 25 30 

Asp Leu He Ala Ala Ala Gly Gly Glu Gly Ala Arg Leu Val Val Phe 
35 40 45 
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Pro Glu Ala Phe lie Pro Ala Tyr Pro Asp Trp Val Trp Ala lie Pro 

50 55 60 

Pro Gly Glu Glu Gly Val Leu Asn Glu Leu Tyr Ala Glu Leu Leu Ser 
65 70 75 80 

Asn Ser Val Thr lie Pro Ser Asp Ala Thr Asp Arg Leu Cys Arg Ala 

85 90 95 

Ala Arg Leu Ala Asn Ala Tyr Val Val Met Gly lie Ser Glu Arg Asn 

100 105 110 

Val Glu Ala Ser Gly Ala Ser Leu Tyr Asn Thr Leu Leu Tyr He Asp 

115 120 125 

Ala Gin Gly Glu lie Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly 

130 135 140 

Gly Glu Arg Leu Val Trp Ala Gin Gly Asp Gly Ser Thr Leu Gin Val 
145 150 155 160 

Tyr Asp Thr Pro Leu Gly Lys Leu Gly Gly Leu He Cys Trp Glu Asn 

165 170 175 

Tyr Met Pro Leu Ala Arg Tyr Thr Met Tyr Ala Trp Gly Thr Gin He 

180 185 190 

Tyr Val Ala Ala Thr Trp Asp Arg Gly Gin Pro Trp Leu Ser Thr Leu 

195 200 205 

Arg His He Ala Lys Glu Gly Arg Val Tyr Val He Gly Cys Cys He 

210 215 220 

Ala Met Arg Lys Asp Asp He Pro Asp Arg Tyr Ala Met Lys Gin Lys 
225 230 235 240 

Phe Tyr Ala Glu Ala Asp Glu Trp He Asn He Gly Asp Ser Ala He 

245 250 255 

Val Asn Pro Glu Gly Gin Phe He Ala Gly Pro Val Arg Lys Gin Glu 

260 265 270 

Glu He Leu Tyr Ala Glu He Asp Pro Arg Met Val Gin Gly Pro Lys 

275 280 285 

Trp Met Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Gin 

290 295 300 

Leu Thr Val His Thr Asp Val Arg Gin Met He Arg Met Glu His Asp 
305 310 315 320 

Ser 



<210> 209 
<211> 993 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 209 

atgaaagtcg tcaaagccgc cgctgtccag atcagtcccg ttctctacag ccgtgaggca 60 

accgtcgaaa aggtcgtgaa gaaaattcac gagcttggcc aactgggcgt gcagttcgcc 120 

acctttcccg agaccgtagt gccttactac ccgtactttt ccgccgtcca gacaggcatt 180 

gagcttctgt ccggaactga gcatctgcgg ctgctcgatc aggccgtgac ggtaccgtct 240 

cccgctaccg atgcgatcgg agaggcggcc cgcaaggcgg gcatggtggt gtccatcggc 300 

gtgaatgaac gcgacggcgg caccttgtac aacacacagt tgctcttcga tgccgatggc 360 

accttgatcc agcgccgccg caagatcacg cccacccact tcgaacggat gatctggggc 420 

cagggggacg gctcgggcct gcgcgccgtc gacagcaagg ttggtcgcat tggtcagctt 480 

gcctgcttcg agcacaacaa cccgctggcc cgctacgcgc tgattgccga cggcgagcag 540 

atccattccg ccatgtatcc gggttctgct ttcggcgaag gctttgccca aaggatggaa 600 

atcaatatcc gccagcatgc gctggagtct ggtgcctttg tcgtcaacgc aacggcctgg 660 

ctggatgctg accagcaggc gcaaatcatc aaggacaccg gctgtgggat tggcccgatc 720 

tcgggcggct gcttcaccac gatcgtggca cccgacggca tgctgatggc cgaacctctg 780 
cgttcgggcg agggtgaggt catcgtggat ctcgacttca cgctgatcga ccgacgcaag 
atgttgatgg actcggcggg ccactataac cgtccagaac tgctcagtct catgattgac 



840 
900 
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cgtaccgcga cggcgcatgt tcacgaacgc gctgcgcatc cggtgtcggg cgcggagcag 960 
ggtccggagg atctgcgcac tccggccgcg tga 993 

<210> 210 
<211> 330 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 210 

Met Lys Val Val Lys Ala Ala Ala Val Gin lie Ser Pro Val Leu Tyr 

15 10 15 

Ser Arg Glu Ala Thr Val Glu Lys Val Val Lys Lys lie His Glu Leu 

20 25 30 

Gly Gin Leu Gly Val Gin Phe Ala Thr Phe Pro Glu Thr Val Val Pro 

35 40 45 

Tyr Tyr Pro Tyr Phe Ser Ala Val Gin Thr Gly He Glu Leu Leu Ser 

50 55 60 

Gly Thr Glu His Leu Arg Leu Leu Asp Gin Ala Val Thr Val Pro Ser 
65 70 75 80 

Pro Ala Thr Asp Ala He Gly Glu Ala Ala Arg Lys Ala Gly Met Val 

85 90 95 

Val Ser He Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn Thr 

100 105 HO 

Gin Leu Leu Phe Asp Ala Asp Gly Thr Leu He Gin Arg Arg Arg Lys 

115 120 125 

He Thr Pro Thr His Phe Glu Arg Met He Trp Gly Gin Gly Asp Gly 

130 135 140 

Ser Gly Leu Arg Ala Val Asp Ser Lys Val Gly Arg He Gly Gin Leu 
145 150 155 160 

Ala Cys Phe Glu His Asn Asn Pro Leu Ala Arg Tyr Ala Leu He Ala 

165 170 175 

Asp Gly Glu Gin He His Ser Ala Met Tyr Pro Gly Ser Ala Phe Gly 

180 185 190 

Glu Gly Phe Ala Gin Arg Met Glu He Asn He Arg Gin His Ala Leu 

195 200 205 

Glu Ser Gly Ala Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp 

210 215 220 

Gin Gin Ala Gin He He Lys Asp Thr Gly Cys Gly He Gly Pro He 
225 230 235 240 

Ser Gly Gly Cys Phe Thr Thr He Val Ala Pro Asp Gly Met Leu Met 

245 250 255 

Ala Glu Pro Leu Arg Ser Gly Glu Gly Glu Val He Val" Asp Leu Asp 

260 265 270 . 

Phe Thr Leu He Asp Arg Arg Lys Met Leu Met Asp Ser Ala Gly His 

275 280 285 

Tyr Asn Arg Pro Glu Leu Leu Ser Leu Met He Asp Arg Thr Ala Thr 

290 295 300 

Ala His Val His Glu Arg Ala Ala His Pro Val Ser Gly Ala Glu Gin 
305 310 315 320 

Gly Pro Glu Asp Leu Arg Thr Pro Ala Ala 
325 330 

<210> 211 
<211> 1062 
<212> DNA 
<213> Unknown 

<220> 
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<223> Obtained from an environmental sample 
<400> 211 

atgggcgtcg cacatccgaa atacaaagtg gccgccgtgc aggcagcgcc cgcttttctc 60 

gacctggacg cctcggtcga aaaggccgtc cgtttcatcg acgaagccgg cgccgccggc 120 

gcccgcctca tcgcctttcc ggagacctgg atacccggtt acccctggtg gatctggcta 180 

ggcgcgccgg cctgggctat catgcgcggc ttcgtctcgc gctatttcga caactcgctc 240 

agctacgaca gcccgcaggc cgagaagcte cgcgccgccg ccaagcgcaa caagatggtg 300 

gtggtgctcg gcctctccga gcgcgacggc ggcagccttt acatcgcgca atggatcatc 360 

ggcccggacg gcgaaaccat cgccaagcgc cgcaagctca agccgaccca cgcggagcgg 420 

accgtgttcg gcgaaggcga cggctcgcat cttgcggtgc acgagcttga tgttggccgg 480 

ctcggcgcgc tgtgctgctg ggaacacctg cagccgctgt ccaaatacgc catgtatgcg 540 

cagaacgaac aggtgcatgt cgcggcctgg ccgagctttt cgctttacga tccgttcgcg 600 

cacgcgctcg gcgcggaagt gaacaatgcg gcgagcaaaa tctatgcggt cgagggctcg 660 

tgtttcgtca tcgcgccgtg cgcgaccgtt tcgcaggcga tgatcgacga actctgcgat 720 

acgccggaga agcatcagtt cctgcatgcc ggcggcggct ttgccgtgat ttacggcccc 780 

gacggcgccc cgctcgcggc gccgctgccg cccgacaagg aaggcttgct ctacgccgac 8 40 

atcgatctcg ggatgatttc ggttgccaaa gcggcagccg atccggccgg gcattatgca 900 

cgccccgacg tcacccggct tctgttcaac aatcggcctg ggtatcgggt cgagaccatg 960 

gcgttgccga tcgatgcgga gaccaaggcg gaagcaccgg ctaagccgga acccaaggca 1020 

ccgaacgtgg cgccgttcgc gccggtgcaa gcggccgagt ga 1062 

<210> 212 
<211> 353 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 212 



Met Gly 


Val 


Ala 


His 


Pro 


Lys 


Tyr 


Lys 


Val 


Ala 


Ala Val Gin Ala Ala 


1 






5 










10 




15 


Pro Ala 


Phe 


Leu 
20 


Asp 


Leu 


Asp 


Ala 


Ser 
25 


Val 


Glu 


Lys Ala Val Arg Phe 
30 


He Asp 


Glu 


Ala 


Gly 


Ala 


Ala 


Gly 


Ala 


Arg 


Leu 


He Ala Phe Pro Glu 




35 










40 








45 , 
Leu Gly Ala Pro Ala 


Thr Trp 


He 


Pro 


Gly 


Tyr 


Pro 


Trp 


Trp 


He 


Trp 


50 










55 










60 


Trp Ala 


He 


Met 


Arg 


Gly 


Phe 


Val 


Ser 


Arg 


Tyr 


Phe Asp Asn Ser Leu 


65 








70 










75 


80 


Ser Tyr 


Asp 


Ser 


Pro 
85 


Gin 


Ala 


Glu 


Lys 


Leu 
90 


Arg 


Ala Ala Ala Lys Arg 
95 


Asn Lys 


Met 


Val 
100 


Val 


Val 


Leu 


Gly 


Leu 
105 


Ser 


Glu' 


Arg Asp Gly Gly Ser 
110 


Leu Tyr 


He 
115 


Ala 


Gin 


Trp 


He 


He 
120 


Gly 


Pro 


Asp 


Gly Glu Thr He Ala 
125 


Lys Arg 


Arg 


Lys 


Leu 


Lys 


Pro 


Thr 


His 


Ala 


Glu 


Arg Thr Val Phe Gly 


130 










135 










140 


Glu Gly 


Asp 


Gly 


Ser 


His 


Leu 


Ala 


Val 


His 


Glu 


Leu Asp Val Gly Arg 


145 








150 










155 


160 


Leu Gly 


Ala 


Leu 


Cys 
165 


Cys 


Trp 


Glu 


His 


Leu 
170 


Gin 


Pro Leu Ser Lys Tyr 
175 


Ala Met 


Tyr 


Ala 
180 


Gin 


Asn 


Glu 


Gin 


Val 
185 


His 


Val 


Ala Ala Trp Pro Ser 
190 


Phe Ser 


Leu 
195 


Tyr 


Asp 


Pro 


Phe 


Ala 
200 


His 


Ala 


Leu 


Gly Ala Glu Val Asn 
205 


Asn Ala 


Ala 


Ser 


Lys 


He 


Tyr 


Ala 


Val 


Glu 


Gly 


Ser Cys Phe Val He 


'210 










215 










220 


Ala Pro 


Cys 


Ala 


Thr 


Val 


Ser 


Gin 


Ala 


Met 


He 


Asp Glu Leu Cys Asp 


225 








230 










235 


240 
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Thr Pro Glu 


Lys 


His Gin Phe Leu 






245 


lie Tyr Gly 


Pro 


Asp Gly Ala Pro 




260 




Lys Glu Gly 


Leu 


Leu Tyr Ala Asp 


275 




280 


Ala Lys Ala 


Ala 


Ala Asp Pro Ala 


290 




295 


Thr Arg Leu 


Leu 


Phe Asn Asn Arg 


305 




310 


Ala Leu Pro 


lie 


Asp Ala Glu Thr 






325 


Glu Pro Lys 


Ala 


Pro Asn Val Ala 




340 




Glu 







His Ala Gly Gly Gly 


Phe 


Ala 


Val 


250 




255 




Leu Ala Ala Pro Leu 


Pro 


Pro 


Asp 


265 


270 






He Asp Leu Gly Met 


He 


Ser 


Val 


285 








Gly His Tyr Ala Arg 


Pro 


Asp 


Val 


300 








Pro Gly Tyr Arg Val 


Glu 


Thr 


Met 


315 






320 


Lys Ala Glu Ala Pro 


Ala Lys 


Pro 


330 




335 




Pro Phe Ala Pro Val 


Gin 


Ala 


Ala 


345 


350 







<210> 213 
<211> 993 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 213 

atgagagttg ttaaagccgc tgctgtccaa ctgagtcccg tcctctatag tcgcgaggga 60 

acggtcgaaa aggtcgtgcg gaagatccat gaacttgccg aagagggagt cgagttcgcc 120 

acctttcctg agaccgtggt gccttactac ccgtactttt ccttcgttca gacgcccttg 180 

gagcaaatct tcggaacaga gtatctgagg ctgctcgacc aggcagtcac cgtgccatcc 240 

cctgccaccg acgcgatcgg cgaggcagcc aggttcgctg gagttgttgt ctcgatcggc 300 

gtcaacgagc gagacggggg aactctatac aacactcagc ttctcttcga tgccgacggc 360 

aggataattc agcggcgccg caagatcacg cccacccatt acgagcgcat gatctggggc 420 

cagggcgacg gctcaggtct gcgggccgtt gatagcaagg ccggccgtat tggtcagctg 480 

gcatgctggg agcacaacaa tccactggcg cgctacgcgc tgatggccga cggcgagcag 540 

atccattccg ccatgtatcc gggctccatg ttcggcgact cgtttgccca gaagaccgaa 600 

atcaatatcc ggcagcatgc cctagagtct gggtgcttcg tcgtgaacgc aacggcctgg 660 

ctggacggcg atcagcaggc gcatatcatg aaggacaccg gctgcagcat cggcccgatc 720 

tccggcggtt gcttcactgc gatcgtcgca cccgatggta gcctgctggg cgaacccatc 780 

cgttccggtg agggcgtggt catcgccgac ctcgacttca cgttgatcga caggcgtaag 840 

caggtgatgg actcgcgagg ccattacagc cggccggagt tgctcagcct cttaatagac 900 

cgcaccccta ccgcgcactt tcacgaacgc gcttcgcccc ccacgacaga agctgagcaa 960 

ggctccgagg atgtgttcga ggctcgcatt taa 993 

<210> 214 
<211> 330 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 214 

Met Arg Val Val Lys Ala Ala Ala Val Gin Leu Ser Pro Val Leu Tyr 

15 10 15 

Ser Arg Glu Gly Thr Val Glu Lys Val Val Arg Lys He His Glu Leu 

20 25 30 

Ala Glu Glu Gly Val Glu Phe Ala Thr Phe Pro Glu Thr Val Val Pro 

35 40 45 

Tyr Tyr Pro Tyr Phe Ser Phe Val Gin Thr Pro Leu Glu Gin He Phe 
50 - 55 60 
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Glv Thr Glu Tyr Leu Arg Leu Leu Asp Gin Ala Val Thr Val Pro Ser 
65 70 75 80 

Pro Ala Thr Asp Ala lie Gly Glu Ala Ala Arg Phe Ala Gly Val Val 

85 90 95 

Val Ser lie Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn Thr 

100 105 HO 

Gin Leu Leu Phe Asp Ala Asp Gly Arg lie He Gin Arg Arg Arg Lys 

115 120 125 

He Thr Pro Thr His Tyr Glu Arg Met He Trp Gly Gin Gly Asp Gly 

130 135 140 

Ser Gly Leu Arg Ala Val Asp Ser Lys Ala Gly Arg He Gly Gin Leu 
145 150 155 160 

Ala Cys Trp Glu His Asn Asn Pro Leu Ala Arg Tyr Ala Leu Met Ala 

165 170 175 

Asp Gly Glu Gin He His Ser Ala Met Tyr Pro Gly Ser Met Phe Gly 

180 185 190. 

Asp Ser Phe Ala Gin Lys Thr Glu He Asn He Arg Gin His Ala Leu 

195 200 205 

Glu Ser Gly Cys Phe Val Val Asn Ala Thr Ala Trp Leu Asp Gly Asp 

210 215 220 

Gin Gin Ala His He Met Lys Asp Thr Gly Cys Ser He Gly Pro He 
225 230 235 240 

Ser Gly Gly Cys Phe Thr Ala He Val Ala Pro Asp Gly Ser Leu Leu 

245 250 255 

Gly Glu Pro He Arg Ser Gly Glu Gly Val Val He Ala Asp Leu Asp 

260 265 270 

Phe Thr Leu He Asp Arg Arg Lys Gin Val Met Asp Ser Arg Gly His 

275 280 285 

Tyr Ser Arg Pro Glu Leu Leu Ser Leu Leu He Asp Arg Thr Pro Thr 

290 295 300 

Ala His Phe His Glu Arg Ala Ser Pro Pro Thr Thr Glu Ala Glu Gin 
305 310 315 320 

Gly Ser Glu Asp Val Phe Glu Ala Arg He 
325 330 

<210> 215 
<211> 1008 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 215 

atgagtatta cccaccccaa atttaaagct gctgttgttc aagctgcccc agtatttcta 60 

gacctagatg ggtctgttaa taaggcgatt aatctcattg atgaagctgc cgctgccgga 120 

gccaagctca ttgccttccc tgaaaccttc attccaggct atccatggtg gatttggctg 180 

ggatcgccgg cgtgggctct gggccggggg ttcgttcagc gttacttcga caattccctg 240 

cagtacgaca gcccgcaggc ggatcgctta cgcgaggcgg cacgacgcaa cagcattacg 300 

gtcgtgctgg gcttgtccga gcgtgatggc ggttctctct atatcgcaca gtggctgatc 360 

ggcccggatg gcgaaaccat cgcgcagcgg cgcaagcttc gtcctactca tggggagcgc 420 

acggtattcg gtgaagggga tggcagcgat ctggtggttc atcaaaccga actgggccgt 480 

cttggcgcgc ttaactgctg ggagaacatc ctgtctctga acaaatatgt gatgtactcc 540 

cagcatgaac aggtccatgt agcatcctgg cccagtttct cgacgtatga accgttcgcg 600 

catgcgctcg gctatgaggt aaacaacgca attagccagg tctatgcggt ggaaggcggg 660 

tgcttcgtgt tggccccgtg ctctaccatc tctgaagaaa tgattgccga actgtgcgat 720 

acacccgata aattcgagct gacgcatgct ggtggcggcc acgcaatcat ctatggtccg 780 

gacggtcgtg ctctgtgcga aaagctgccc gagaaccagg agggcctgct gtacgcggaa 840 

atcgatctgg gggtgatttc tatggcaaaa agtgccatgg atcctgtcgg ccattactct onn 
cgccccgatg tctaccgtgt gctgttcaat aagatcccgg caaagcgtat cgagcacttc 
aatttgccgt tggatgagca agcaggggaa gagccaccag ctgattaa 
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<210> 216 
<211> 335 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 216 

Met Ser He Thr His Pro Lys Phe Lys Ala Ala Val Val Gin Ala Ala 

15 10 15 

Pro Val Phe Leu Asp Leu Asp Gly Ser Val Asn Lys Ala He Asn Leu 

20 25 30 

He Asp Glu Ala Ala Ala Ala Gly Ala Lys Leu lie Ala Phe Pro Glu 

35 40 45 

Thr Phe He Pro Gly Tyr Pro Trp Trp He Trp Leu Gly Ser Pro Ala 

50 55 60 

Trp Ala Leu Gly Arg Gly Phe Val Gin Arg Tyr Phe Asp Asn Ser Leu 
65 70 75 80 

Gin Tyr Asp Ser Pro Gin Ala Asp Arg Leu Arg Glu Ala Ala Arg Arg 

85 90 95 

Asn Ser He Thr Val Val Leu Gly Leu Ser Glu Arg Asp Gly Gly Ser 

100 105 110 

Leu Tyr He Ala Gin Trp Leu He Gly Pro Asp Gly Glu Thr He Ala 

115 120 125 

Gin Arg Arg Lys Leu Arg Pro Thr His Gly Glu Arg Thr Val Phe Gly 

130 135 140 

Glu Gly Asp Gly Ser Asp Leu Val Val His Gin Thr Glu Leu Gly Arg 
145 150 155 160 

Leu Gly Ala Leu Asn Cys Trp Glu Asn He Leu Ser Leu Asn Lys Tyr 

165 170 175 

Val Met Tyr Ser Gin His Glu Gin Val His Val Ala Ser Trp Pro Ser 

180 185 190 

Phe Ser Thr Tyr Glu Pro Phe Ala His Ala Leu Gly Tyr Glu Val Asn 

195 200 205 

Asn Ala He Ser Gin Val Tyr Ala Val Glu Gly Gly Cys Phe Val Leu 

210 215 220 

Ala Pro Cys Ser Thr He Ser Glu Glu Met lie Ala Glu Leu Cys Asp 
225 230 235 240 

Thr Pro Asp Lys Phe Glu Leu Thr His Ala Gly Gly Gly His Ala He 

245 250 255 

He Tyr Gly Pro Asp Gly Arg Ala Leu Cys Glu Lys Leu Pro Glu Asn 

260 265 270 

Gin Glu Gly Leu Leu Tyr Ala Glu He Asp Leu Gly Val He Ser Met 

275 280 285 

Ala Lys Ser Ala Met Asp Pro Val Gly His Tyr Ser Arg Pro Asp Val 

290 295 300 

Tyr Arg Val Leu Phe Asn Lys He Pro Ala Lys Arg He Glu His Phe 
305 310 315 320 

Asn Leu Pro Leu Asp Glu Gin Ala Gly Glu Glu Pro Pro Ala Asp 
325 330 335 

<210> 217 
<211> 1011 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
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<400> 217 

gtgggtatca gccatccgaa attcaaggct gcggtcgtac aggccgggcc tgccttcctc 60 

gacctcgacg gcggcgtcga acgagccgtg tcgctcatcg gccaagcagc ggccgaaggg 120 

gcacagctga ttgcctttcc tgaaacgtgg attcccggtt acccgtggca cacctggctt 180 

ggcagcccgg cgtgggcaat ggaaaaaggc tttgtccaac gatatttcga caacgcgttg 240 

cggcatggtt ctccgcaagc cgagcgaatc tccggggctg cggcggagca caagattatg 300 

gtgtcgcttg ggtttgcgga acgcgatgga ggcacgcttt atatcgcgca gtggctcatc 360 

ggacccgacg gccaaactat ctcacgacgg cggaagctta agccgactca cgtcgagcgc 420 

actgtatttg gcgagggaga cggaagcgat ctctccgtgc atgatacggc gcttggacgt 480 

atcggctcac tttgttgctg ggagcatttg caaccgttgt cgaaatacgc aatgtacgcc 540 

cagaatgaac agattcacat tggcgcatgg cccagctttt cgctatacca gccatttgcg 600 

aatgcgctga gtcccgaagt caatatcgca gtaagccgcg tgtacgccgt ggaaggccag 660 

tgtttcttcc tcgcgccgtg cgcgacggtt tcggacgcca tgatcgaaac actgtgcgat 720 

acgcccgaaa agcagggact gattcgggcg ggtggcgggc acgccgcgat cttcggccca 780 

gatggaagtc tgctgacgcc tacggtagcg gatacttacg agggcctgct gtatgcagaa 840 

ctcgacctcg gcgtcatttc gatcgccaag agtgcagcgg accccgccgg ccactattcg 900 

cggccagatg tcacacgcct tctattgaat cagacgcctt cgaagcgcgt tcagaatatg 960 

gtgttaccac tggagacggt cacggagccc gaaggcccgg ttcagcccta g 1011 

<210> 218 
<211> 336 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 218 



Val Gly 


He 


Ser 


His 


Pro 


Lys 


Phe 


Lys Ala Ala Val 


Val 


Gin 


Ala 


Gly 


1 






5 








10 






15 




Pro Ala 


Phe 


Leu 


Asp 


Leu 


Asp 


Gly 


Gly Val Glu Arg 


Ala 


Val 


Ser 


Leu 






20 










25 




30 






lie Gly 


Gin 


Ala 


Ala 


Ala 


Glu 


Gly 


Ala Gin Leu He 


Ala 


Phe 


Pro 


Glu 


35 










40 




45 








Thr Trp 


He 


Pro 


Gly 


Tyr 


Pro 


Trp 


His Thr Trp Leu 


Gly 


Ser 


Pro 


Ala 


50 










55 




60 










Trp Ala 


Met 


Glu 


Lys 


Gly 


Phe 


Val 


Gin Arg Tyr Phe 


Asp 


Asn 


Ala 


Leu 


65 








70 






75 








80 


Arg His 


Gly 


Ser 


Pro 


Gin 


Ala 


Glu 


Arg He Ser Gly 


Ala 


Ala 


Ala 


Glu 








85 








90 






95 




His Lys 


He 


Met 


Val 


Ser 


Leu 


Gly 


Phe Ala Glu Arg 


Asp 


Gly Gly Thr 






100 










105 




110 






Leu Tyr 


He 


Ala 


Gin 


Trp 


Leu 


He 


Gly Pro Asp Gly 


Gin 


Thr 


He 


Ser 




115 










120 




125 








Arg Arg 


Arg 


Lys 


Leu 


Lys 


Pro 


Thr 


His Val Glu Arg 


Thr 


Val 


Phe 


Gly 


130 










135 




140 










Glu Gly 


Asp 


Gly 


Ser 


Asp 


Leu 


Ser 


Val His Asp Thr 


Ala 


Leu 


Gly Arg 


145 








150 






155 








160 


He Gly 


Ser 


Leu 


Cys 


Cys 


Trp 


Glu 


His Leu Gin Pro 


Leu 


Ser 


Lys 


Tyr 






165 








170 






175 




Ala Met 


Tyr 


Ala 


Gin 


Asn 


Glu 


Gin 


He His He Gly 


Ala 


Trp 


Pro 


Ser 




180 










185 




190 






Phe Ser 


Leu 


Tyr 


Gin 


Pro 


Phe 


Ala 


Asn Ala Leu Ser 


Pro 


Glu 


Val 


Asn 




195 










200 




205 








He Ala 


Val 


Ser 


Arg 


Val 


Tyr 


Ala 


Val Glu Gly Gin 


Cys 


Phe 


Phe 


Leu 


210 










215 




220 










Ala Pro 


Cys 


Ala 


Thr 


Val 


Ser 


Asp 


Ala Met He Glu 


Thr 


Leu 


Cys 


Asp 


225 






230 






235 








240 


Thr Pro 


Glu 


Lys 


Gin 


Gly 


Leu 


He 


Arg Ala Gly Gly 


Gly 


His 


Ala 


Ala 








245 








250 






255 




He Phe 


Gly 


Pro 


Asp 


Gly 


Ser 


Leu 


Leu Thr Pro Thr 


Val 


Ala Asp 


Thr 
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260 265 270 

Tvr Glu Gly Leu Leu Tyr Ala Glu Leu Asp Leu Gly Val He Ser He 

275 280 285 

Ala Lys Ser Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp Val 

290 295 300 

Thr Arg Leu Leu Leu Asn Gin Thr Pro Ser Lys Arg Val Gin Asn Met 
305 310 315 320 

Val Leu Pro Leu Glu Thr Val Thr Glu Pro Glu Gly Pro Val Gin Pro 
325 330 335 

<210> 219 
<211> 996 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 219 

atgaaaatcg tcaaagctgc cgccgttcaa attagtcccg tgctttatag ccgcgatgga 
acggtggaaa aagttgtgcg gaagatccat gagctcggcc aacagggagt gcagttcgca 
accttccccg agaccgtgat tccgtactat ccctattttt cctttctgca acccgcctac 
cagatcgcgg ccgggcagga gcacctcaag ttgctcgacc aggcggtgac ggtgccgtcc 
gctgccacgc acgcgatcgg ccaggcatgc aaacaggcag gagtggttgt ttcgatcggc 
atcaatgagc gggacaatgg aacgatctac aacacccagc ttctcttcga ctccgacgga 
accctgctgc agcggcgccg caagatctca cccacttacc acgaacggat gatctggggc 
cagggcgacg gttccggcct gcgcgccgtg gatagcaagg ccggccggat tgggcagctc 
qcctgttggg agcactacaa ccctttggcg cgctacgcga tgatggccga cggcgagcag 
atccattccg ccatgtaccc tggctccatc ttcggcgatc tgtttgccga acagacccag 
atcaacgtcc ggcagcacgc gttggagtcc ggctgtttcg tagtctgctc caccgcctgg 660 
ttggatcccg atcaacaagc gcaaatcgtc aaggacacag gcggcgccat tggtgcgatc /^u 
tcgggcggct gtttcacggc gattgtggcc cccgatagca ctctgctcgg agagccgatc 
cgctccggcg aaggcgtggt cattgcggat ctcgacttca cccggatcga caagcgtaaa 
cagttgatgg attcccgagg tcattacagc cggccggaat tgctgagtct gctgattgat 
cgcacttcca ccgcacatgt gcaaaaccgc gtgccggccg atccattcgg cgccggctgt 
gttgcagaac caggagtaga aacatgccca cgatag 

<210> 220 
<211> 331 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

Met°Lvs 2 Ile Val Lys Ala Ala Ala Val Gin He Ser Pro Val Leu Tyr 

1 5 10 15 

Ser Arg Asp Gly Thr Val Glu Lys Val Val Arg Lys He His Glu J,eu 

20 25 30 

Glv Gin Gin Gly Val Gin Phe Ala Thr Phe Pro Glu Thr Val He Pro 

35 40 4 5 

Tyr Tyr Pro Tyr Phe Ser Phe Leu Gin Pro Ala Tyr Gin He Ala Ala 

50 55 60 

Glv Gin Glu His Leu Lys Leu Leu Asp Gin Ala Val Thr Val Pro Ser 
65 70 7 * 80 

Ala Ala Thr His Ala He Gly Gin Ala Cys Lys Gin Ala Gly Val Val 

85 90 95 

Val Ser He Gly He Asn Glu Arg Asp Asn Gly Thr He Tyr Asn Thr 

100 105 11° 

Gin Leu Leu Phe Asp Ser Asp Gly Thr Leu Leu Gin Arg Arg Arg Lys 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



720 
780 
840 
900 
960 
996 
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115 








120 


125 




lie Ser 


Pro 


Thr 


Tyr 


His 


Glu Arg Met He 


Trp Gly Gin Gly 


Asp Gly 


i "*n 










135 


140 




Ser Gly 


Leu Arg Ala Val 


Asp Ser Lys Ala 


Gly Arg He Gly 


Gin Leu 


1 4 S 








1 en 




155 


160 


Ala Cys 


Trp 


Glu 


His 


Tyr 


A^n Pr*rt TiPii Ala 

noli ±. \j u n -i- ci 


Arg Tyr Ala Met 


Met Ala 








165 




1 70 




175 


Asp Gly 


Glu 


Gin 


He 


His 


Cor Ala M<=»t" Tv/T* 


Pm fil w 5?pr Tip 


Phe Gly 




180 






IRS 


190 




nop liCU 


Phe 


Ala 


Glu 


Gin 


Thr (Zl n Tip Asn 

1111 vJ.il jlj>g no(t 


Val Arg Gin His 


Ala Leu 


1 ft c 
19o 








C.\J\J 


205 




Glu Ser 


Gly 


Cys 


Phe 


Val 


Val C\r q Q ca r* Thr 
vdx v^yo oci xiij. 


Ala Tto T.p\i Asn 


Pro Asd 














220 




bin tj±n 


Ala 


Gin 


He 


Val 


t,i/q Acrt Thr m u 


Glv Ala He Glv 


Ala He 










230 




235 


240 


Ser Gly 


Gly 


Cys 


Phe 


Thr 


Ala He Val Ala 


Pro Asp Ser Thr 


Leu Leu 








245 




250 




255 


Gly Glu 


Pro 


He 


Arg 


Ser 


Gly Glu Gly Val 


Val He Ala Asp 


Leu Asp 




260 






265 


270 




Phe Thr 


Arg 


He Asp Lys 


Arg Lys Gin Leu 


Met Asp Ser Arg 


Gly His 




275 








280 


285 




Tyr Ser 


Arg 


Pro 


Glu 


Leu 


Leu Ser Leu Leu 


He Asp Arg Thr 


Ser Thr 


290 










295 


300 




Ala His 


Val 


Gin Asn Arg 


Val Pro Ala Asp 


Pro Phe Gly Ala 


Gly Cys 


305 








310 




315 


320 


Val Ala 


Glu Pro Gly Val 


Glu Thr Cys Pro 


Arg 





325 330 



<210> 221 
<211> 1146 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 221 

atgttgacct acaaaggcgt gttcaaagca gcgacagttc aggcggagcc ggtgtggatg 60 

gacgctgacg ccaccattac caaagccatc cggatcatcg aagaggcggc tgataacggg 120 

gcgaagtttg tcgcattccc ggaggtgttc atccctggat acccttggtg gatctggctg 180 

ggcactgcga tgtggggagc caaattcgtc gtgcccttcc acgaaaattg ccttgagctt 2 40 

ggcgacaagc ggatgcagcg cattcaagct gcggcaaaac aaaatggcat tgcactcgtc 300 

atgggttacg gtgaacgtga cggaggtagc cgctacatga gccaggtatt catcgacgat 360 

agcggaaaaa ttgtggcgaa ccgccgcaag ctcaaaccga cgcacgaaga gcgaacgatt 420 

ttcggcgagg gaaacggatc cgatttcatc acccatgact tcccatttgc gagagtcgga 480 

ggcttcaact gctgggaaca ccttcagccc ctgagcaaat acatgatgta cagcctgcag 540 

gaacaggtgc atgtcgcgtc gtggccggcc atgtgcacct atcagcctga cgtgccccag 600 

ttaggtgcag gcgctaacga ggcggtcacc cgctcttacg cgatcgaagg cgcatgttat 660 

gtgctgggag cgacactcgt tatcggcaag gcagcccatg atgcattctg cgacactgaa 720 

gagcatcaca agctgcttgg catgggcgga gggtgggccc gaattttcgg cccggacggt 780 

gagtatctcg cagaatcgct cgctcatgac gcggagggca tcctgtacgc tgacattgat 840 

ctgagcaaga tcctgctcgc caaagcaaac acggacacag ttggccacta cgcgcgacct 900 

gacgtcttgt cattgctcgt caacacacac aatcccgggc ctgtacgcta tcttgacgaa 960 

gagggccggc aggtttcaac gtcaattcgc aggcacgaaa aactggaagg ccagagcctc 1020 

gacctcgagg ttacgccagc gacaccagcg accctggaca tcgcgagcct ggtacaacag 1080 

gctaagccat cgaccgttaa gagcgagagt aacgcgagca caaaacaacc ggatctcgcg 1140 

gtctga U 4 6 

<210> 222 
<211> 381 
<212> PRT 
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<213> Unknown 
<220> 

<223> Obtained from an environmental sample 
<400> 222 

Met Leu Thr Tyr Lys Gly Val Phe Lys Ala Ala Thr Val Gin Ala Glu 

15 10 15 

Pro Val Trp Met Asp Ala Asp Ala Thr lie Thr Lys Ala lie Arg lie 

20 25 30 

lie Glu Glu Ala Ala Asp Asn Gly Ala Lys Phe Val Ala Phe Pro Glu 

35 40 45 

Val Phe He Pro Gly Tyr Pro Trp Trp He Trp Leu Gly Thr Ala Met 

50 55 60 

Trp Gly Ala Lys Phe Val Val Pro Phe His Glu Asn Cys Leu Glu Leu 
65 70 75 80 

Gly Asp Lys Arg Met Gin Arg He Gin Ala Ala Ala Lys Gin Asn Gly 

85 90 95 

He Ala Leu Val Met Gly Tyr Gly Glu Arg Asp Gly Gly Ser Arg Tyr 

100 105 110 

Met Ser Gin Val Phe He Asp Asp Ser Gly Lys He Val Ala Asn Arg 

115 120 125 

Arg Lys Leu Lys Pro Thr His Glu Glu Arg Thr lie Phe Gly Glu Gly 

130 135 140 

Asn Gly Ser Asp Phe He Thr His Asp Phe Pro Phe Ala Arg Val Gly 
145 150 155 160 

Gly Phe Asn Cys Trp Glu His Leu Gin Pro Leu Ser Lys Tyr Met Met 

165 170 175 

Tyr Ser Leu Gin Glu Gin Val His Val Ala Ser Trp Pro Ala Met Cys 

180 185 190 

Thr Tyr Gin Pro Asp Val Pro Gin Leu Gly Ala Gly Ala Asn Glu Ala 

195 200 205 

Val Thr Arg Ser Tyr Ala He Glu Gly Ala Cys Tyr Val Leu Gly Ala 

210 215 220 

Thr Leu Val lie Gly Lys Ala Ala His Asp Ala Phe Cys Asp Thr Glu 
225 230 235 240 

Glu His His Lys Leu Leu Gly Met Gly Gly Gly Trp Ala Arg lie Phe 

245 250 255 

Gly Pro Asp Gly Glu Tyr Leu Ala Glu Ser Leu Ala His Asp Ala Glu 

260 265 ' 270 

Gly lie Leu Tyr Ala Asp He Asp Leu Ser Lys He Leu Leu Ala Lys 

275 280 285 

Ala Asn Thr Asp Thr Val Gly His Tyr Ala Arg Pro Asp Val Leu Ser 

. 290 295 300 

Leu Leu Val Asn Thr His Asn Pro Gly Pro Val Arg Tyr Leu Asp Glu 
305 310 315 320 

Glu Gly Arg Gin Val Ser Thr Ser lie Arg Arg His Glu Lys Leu Glu 

325 330 335 

Gly Gin Ser Leu Asp Leu Glu Val Thr Pro Ala Thr Pro Ala Thr Leu 

340 345 350 

Asp He Ala Ser Leu Val Gin Gin Ala Lys Pro Ser Thr Val Lys Ser 

355 360 365 

Glu Ser Asn Ala Ser Thr Lys Gin Pro Asp Leu Ala Val 
370 375 380 

<210> 223 
<211> 996 
<212> DNA 
<213> Unknown 

<220> 
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<223> Obtained from an environmental sample 



<400> 223 

atgagagttg 

acggtcgaaa 

gcgttccctg 

caaatccttc 

tatagcactg 

gtcaacgagc 

acgttgatcc 

cagggcgatg 

gcatgctttg 

atccattccg 

attgccgtaa 

ctcgatcccg 

atctccgggg 

atccgctcgg 

aaacatattg 

gatcgcactc 

ggagcgtcgg 



ttaaagccag 
aggtcgtagc 
agaccgtggt 
gcggcggtga 
aagccatcgg 
gtgatggcgg 
aaagacgacg 
gctcaggtct 
agcactacaa 
ccatgttccc 
ggcaacatgc 
gccagcaggc 
gttgcttcac 
gtgaaggcga 
tcgactcgag 
ccaccgcgca 
aacatcttcg 



cgcagttcaa 
gaagattcat 
gccttattac 
gttcgtcaag 
cgaagcctgc 
aacgatctac 
caagatcacc 
gcgcgctgtt 
ccctcttgcg 
cggttccatg 
gatggagtcc 
tcagatcgcc 
tgcgatcatc 
agtcatcgtc 
aggacattac 
ccttcacgac 
cgaagacgcc 



ctgaagcctg 
gagctagggc 
ccgtactttt 
ctcctcgatc 
aggcaggagg 
aacgcgcagt 
cccacccatt 
gacagcaatg 
cggtacgcga 
ttcggcgatg 
gggtgctttg 
aacgacaccg 
gcacccgatg 
gacctcgatt 
agccggccag 
cgcgctgtgc 
gcctga 



tactttatag 
agcagggtgt 
cgatcgtgca 
agtccgtgac 
agatggttgt 
tactctttga 
acgaacgcat 
tcgcacgcat 
tgatggccga 
gttttgcgga 
tcgtttgcgc 
gtatcaccga 
ggagcctgct 
tcacgttaat 
aattgctgag 
agcacaatgc 



ccgcgagggg 
gcagttcgcc 
gtccggctat 
ggtgccatct 
ctccataggc 
ttccgacggc 
gatctggggc 
tggtcaactg 
tggcgagcag 
gaggacggaa 
tacggcctgg 
catcggcccg 
gggccaacct 
tgacaagcgg 
cctgctgatc 
cggatcggaa 



<210> 224 
<211> 331 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
996 



<400> 224 

Met Arg Val Val Lys Ala Ser Ala Val Gin Leu Lys Pro Val Leu Tyr 

15 10 15 

Ser Arg Glu Gly Thr Val Glu Lys Val Val Ala Lys He His Glu Leu 

20 25 30 

Gly Gin Gin Gly Val Gin Phe Ala Ala Phe Pro Glu Thr Val Val Pro 

35 40 45 

Tyr Tyr Pro Tyr Phe Ser He Val Gin Ser Gly Tyr Gin He Leu Arg 

50 55 - 60 

Gly Gly Glu Phe Val Lys Leu Leu Asp Gin Ser Val Thr Val Pro Ser 
65 70 75 80 

Tyr Ser Thr Glu Ala He Gly Glu Ala Cys Arg Gin Glu Glu Met Val 

85 90 95 

Val Ser He Gly Val Asn Glu Arg Asp Gly Gly Thr He Tyr Asn Ala 

100 105 HO 

Gin Leu Leu Phe Asp Ser Asp Gly Thr Leu He Gin Arg Arg Arg Lys 

115 120 125 

He Thr Pro Thr His Tyr Glu Arg Met He Trp Gly Gin Gly Asp Gly 

130 135 140 

Ser Gly Leu Arg Ala Val Asp Ser Asn Val Ala Arg He Gly Gin Leu 
145 150 155 160 

Ala Cys Phe Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Met Met Ala 

165 170 175 

Asp Gly Glu Gin He His Ser Ala Met Phe Pro Gly Ser Met Phe Gly 

180 185 190 

Asp Gly Phe Ala Glu Arg Thr Glu He Ala Val Arg Gin His Ala Met 

195 200 205 

Glu Ser Gly Cys Phe Val Val Cys Ala Thr Ala Trp Leu Asp Pro Gly 

210 215 220 

Gin Gin Ala Gin lie Ala Asn Asp Thr Gly He Thr Asp He Gly Pro 
225 230 235 240 

He Ser Gly Gly Cys Phe Thr Ala He He Ala Pro Asp Gly Ser Leu 
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245 250 255 

Leu Gly Gin Pro lie Arg Ser Gly Glu Gly Glu Val lie Val Asp Leu 

260 265 270 

Asp Phe Thr Leu lie Asp Lys Arg Lys His lie Val Asp Ser Arg Gly 

275 280 285 

His Tyr Ser Arg Pro Glu Leu Leu Ser Leu Leu lie Asp Arg Thr Pro 

290 295 300 

Thr Ala His Leu His Asp Arg Ala Val Gin His Asn Ala Gly Ser Glu 
305 310 315 320 

Gly Ala Ser Glu His Leu Arg Glu Asp Ala Ala 
325 330 

<210> 225 
<211> 951 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 225 

atgaccaccg tcaaagccgc cgctgtccag atgagtcccg tgctttacag tcgcgacgat 60 

accatcgaga aaatttgtcg acagatcatc gagcttggcc gacagggagt gcagttcgcc 120 

acgtttccag agacagttat tccgtattat ccatatttcg ccttcgtgca gcggccttac 180 

gaaatgtcgg cccaatatca tcagttgctt gatcaagcgg tgaccgtgcc ttccggttca 240 

acgcacgcca ttggggccgc ctgtaaacaa gccggaattg ttgtctcaat cggcgtcaac 300 

gagcgggagg gcggcacgct ctatggcaca cagttgctgt ttgatgcaga cggcctcttg 360 

atccagcgtc gccgcaagat cactccgacc taccacgagc ggatgatttg gggacagggt 420 

gacggttccg ggctgcgagc cgtagatagc gcggtcggtc gaatcggaca gttggcgtgt 480 

tgggaacacc acaatccgct ggctcgttat gctttagcgg ccgatggcga acaaattcac 540 

gcggcgatgt accctggctc gatcttgggt gaactatttg ccgagcagat tcaggtcaac 600 

atccggcagc acgccatgga atctggttgc ttcgtcgtga acgccacggc ctggctaagc 660 

gaggaacagc aagcccgaat catgaaggac accggatcat tcgatagccc aatcaccggt 720 

ggttgcttta ccgccattgt cgcgcccaac gggcagataa tcggtgaacc gctgcgcatc 780 

ggcgaaggcg tcgtgattgc cgatttggac ttcgctttga ttgatgagag gaagcggctg 840 

atggactcac gcggcctcta tagccgccct gagttgctaa gcttgttaat cgacagaatg 900 

cctacatccc atgtgcatga acgggttgag cgtagcatgg cgatggcatg a 951 

<210> 226 
<211> 316 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 226 



Met Thr 


Thr 


Val 


Lys 


Ala 


Ala 


Ala 


Val 


Gin 


Met Ser 


Pro Val 


Leu 


Tyr 


1 






5 










10 






15 




Ser Arg 


Asp 


Asp 


Thr 


He 


Glu 


Lys 


He 


Cys 


Arg Gin 


He He 


Glu 


Leu 






20 










25 






30 






Gly Arg 


Gin 


Gly 


Val 


Gin 


Phe 


Ala 


Thr 


Phe 


Pro Glu 


Thr Val 


He 


Pro 




35 










40 








45 






Tyr Tyr 


Pro 


Tyr 


Phe 


Ala 


Phe 


Val 


Gin 


Arg 


Pro Tyr 


Glu Met 


Ser 


Ala 


50 










55 








60 








Gin Tyr 


His 


Glii 


Leu 


Leu Asp 


Gin 


Ala 


Val 


Thr Val 


Pro Ser Gly 


Ser 


65 








70 










75 






80 


Thr His 


Ala 


He 


Gly 


Ala 


Ala 


Cys 


Lys 


Gin 


Ala Gly 


He Val 


Val 


Ser 








85 








90 






95 




He Gly 


Val 


Asn 


Glu 


Arg 


Glu 


Gly 


Gly 


Thr 


Leu Tyr 


Gly Thr 


Gin 


Leu 



100 105 110 
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Leu Phe 


Asp Ala Asp 
115 


Gly 


Leu 


Leu lie Gin Arg Arg Arg 
ion i ot 


Lys 


He 


Thr 


Pro Thr 


Tyr His Glu 


Arg 


Met 


lie lrp uiy bin uiy Asp 


Gly 


Ser 


Gly 


130 






loo 


14 u 








Leu Arg 


Ala Val Asp 


Ser 


Ala 


vai uiy Arg lie tjiy win 


Leu 


Ala 


Cys 


145 




150 




loo 






160 


Trp Glu 


His His Asn 


Pro 


Leu 


Ala Arg Tyr Ala Leu Ala 


Ala Asp Gly 


165 






170 




175 




Glu Gin 


lie His Ala 


Ala 


Met 


Tyr Pro Gly ser lie jjeu 


Gly Glu Leu 




180 






185 


190 






Phe Ala 


Glu Gin lie 
195 


Gin 


Val 


Asn lie Arg Gin nis Aia 
200 20o 


Met 


Glu 


Ser 


Gly Cys 


Phe Val Val 


Asn 


Ala 


Thr Ala Trp Leu Ser Glu 


Glu 


Gin 


Gin 


210 






215 


220 








Ala Arg 


lie Met Lys 


Asp 


Thr 


Gly Ser Phe Asp Ser Pro 


He 


Thr 


Gly 


225 




230 










240 


Gly Cys 


Phe Thr Ala 


lie 


Val 


Ala Pro Asn Gly Gin lie 


He 


Gly Glu 


245 






250 




255 




Pro Leu 


Arg lie Gly 


Glu Gly 


Val Val lie Ala Asp Leu 


Asp 


Phe 


Ala 




260 






265 


270 






Leu lie 


Asp Glu Arg 


Lys 


Arg 


Leu Met Asp Ser Arg Gly 


Leu Tyr 


Ser 




275 






280 285 








Arg Pro 


Glu Leu Leu 


Ser 


Leu 


Leu lie Asp Arg Met Pro 


Thr 


Ser 


His 


290 






295 


300 








Val His 


Glu Arg Val 


Glu Arg 


Ser Met Ala Met Ala 








305 


310 




315 









<210> 227 
<211> 1035 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 227 

atgtccgatc gaegcategt tcgcgcggca gecattcaga tcgccccgga tctcgagcgc 60 

tgcagtgtaa cgctcgagaa ggtttgctcg gcgatcgacg aagccgcggg caaaggeget 120 

caactgagcg tctttcccga gaccttcgtc ccgtactacc cctatttctc gtttgtccgc 180 

cctccggtcg categgggge cgatcacatg eggctatacg aggaageggt ggtggtgccg 240 

ggtccc.gtga cgcaggctgt ttccgaacgt gcgcgcatgc atgggatggt ggtggtgctc 300 

ggcgtcaacg agcgcgacca cgggagcctc tacaacaccc aactgatttt egattgegat 360 

ggteggcteg ctctgaaacg ccgcaagatc acgccgacgt ttcacgagcg catgatctgg 420 

ggccagggcg acgccagtgg actgaaagtc gcgcgcacgg gtatcgggcg ggtcggcgcg 480 

ctcgcgtgct gggagcatta caacccgctc gcgcgctacg egctgatgae ccagcacgaa 540 

gagatccatt gcagtcaatt tccaggctcg ctcgtcggtc cgatcttctc cgagcagatg 600 

gaegtgaega tccgccatca cgccctcgaa teggggtget tcgtcgtgaa cgcaaccggt 660 

tggctcaegg aegegcagat cgcctcgatc accgatgacc cgaagcttca aegggegctg 720 

cgcggtggct gcaacacggc gategtttea ccggaaggcc agcatctggc ggagccgttg 780 

cgcgaaggcg aggggatggt ggtcgeggae ctcgacatgt cgctcatcac caagegcaag 840 

cgaatgatgg atteggtegg ccactatgcg cggccggaac tgttgagcct cgcgatcaac 900 

gatcgtcccg cagccccttt eggceggatg tgcgctgccg aagcaatgcg gggagecgae 960 

gaegtegtea cagtaggagc atttcatgag cgccagcgag aacgtgtcgg egaagagecg 1020 

gcaattgatg actga 1035 



<210> 228 
<211> 344 
<212> PRT 
<213> Unknown 

<220> 



Page 149 



WO 03/097810 PCT/US03/15712 



<223> Obtained from an environmental sample 
^40 0^ 228 

Met Ser Asp Arg Arg He Val Arg Ala Ala Ala lie Gin He Ala Pro 

Asp Leu Glu Arg Cys Ser Val Thr Leu Glu Lys Val Cys Ser Ala He 

20 25 30 

Asp Glu Ala Ala Gly Lys Gly Ala Gin Leu Ser Val Phe Pro Glu Thr 

35 40 45 

Phe Val Pro Tyr Tyr Pro Tyr Phe Ser Phe Val Arg Pro Pro Val Ala 

50 55 60 

Ser Gly Ala Asp His Met Arg Leu Tyr Glu Glu Ala Val Val Val Pro 
65 70 75 80 

Glv Pro Val Thr Gin Ala Val Ser Glu Arg Ala Arg Met His Gly Met 

85 90 95 

Val Val Val Leu Gly Val Asn Glu Arg Asp His Gly Ser Leu Tyr Asn 

100 105 HO 

Thr Gin Leu He Phe Asp Cys Asp Gly Arg Leu Ala Leu Lys Arg Arg 

115 120 125 

Lys He Thr Pro Thr Phe His Glu Arg Met lie Trp Gly Gin- Gly Asp 

130 135 140 

Ala Ser Gly Leu Lys Val Ala Arg Thr Gly He Gly Arg Val Gly Ala 
145 150 155 160 

Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met 

165 170 175 

Thr Gin His Glu Glu He His Cys Ser Gin Phe Pro Gly Ser Leu Val 

180 185 190 

Gly Pro He Phe Ser Glu Gin Met Asp Val Thr He Arg His His Ala 

195 200 205 

Leu Glu Ser Gly Cys Phe Val Val Asn Ala Thr Gly Trp Leu Thr Asp 

210 215 220 

Ala Gin He Ala Ser He Thr Asp Asp Pro Lys Leu Gin Arg Ala Leu 
225 230 235 240 

Ara Glv Gly Cys Asn Thr Ala He Val Ser Pro Glu Gly Gin His Leu 

245 250 255 

Ala Glu Pro Leu Arg Glu Gly Glu Gly Met Val Val Ala Asp Leu Asp 

260 265 270 

Met Ser Leu He Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His 

275 280 285 

Tvr Ala Arg Pro Glu Leu Leu Ser Leu Ala He Asn Asp Arg Pro Ala 

290 295 300 

Ala Pro Phe Gly Arg Met Cys Ala Ala Glu Ala Met Arg Gly Ala Asp 
305 310 315 320 

Asn Val Val Thr Val Gly Ala Phe His Glu Arg Gin Arg Glu Arg Val 

325 330 335 

Gly Glu Glu Pro Ala He Asp Asp 
340 

<210> 229 
<211> 975 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
^400^ 229 

atgcctgaaa acaaagtcaa agttgccctc gttcagcatc cgcctgtttt tctgaatttg 60 
ccgaaaacgt tggaaaaagt agaaggtttg gcgcgagagt gcgccgccaa tgaagcgaaa 120 
atcgttgtct tlcctgaalc ctggctgacc ggctatccgg tctggcttga tgaagcgccg 180 
aga?ccgcgc tgtgggatta tccgcccgcc aagcgtttgt atcaatacct cacggaaaat 240 
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tcattgcaga ttccgagcgc tgagtttgaa tctttgcgcg aaatcgctaa gaaaaattcc 300 

ctttatttag tcgttggagt tcacgaacga agcggcggaa cgctctacaa tacgataatt 360 

tatctcacgc ccgacggcag ttataaaact caccgcaaat tggttccaac ttacacggaa 420 

agacttgtct ggggcgcagg cgacggaagc ggtctaaatg ttgtggaaac gccttacggg 480 

attctcggag gtttgatttg ctgggaacat tggatgcctc tggctagggc ggcaatgcat 540 

tcaaaaaatg aagcgattca cgtttgccaa tttcccacgg ttcacgagcg acatcaaatc 600 

gccagccgtc attacgcctt cgaagggcag tgttttgtct tgacttccgg ttgcgcgatg 660 

acgaaaacgg atgttttgga aggttttgaa tcgctcgaaa caaacgacca cgaagttttc 720 

gggcttttgg attcgataga aaaggaagaa ctgatgcgtg gcggaagcgc gattattgcg 780 

cccgatttga gctattcggt cgagccggtt tttgacgaaa aaacgattgt ttacggcgaa 840 

ttaaatctcg atttaaccaa gcagggacat ctgtttttgg ataccgacgg acattattcg 900 

cgtcccgatg ttttcgagtt gcgcgtcaac gataaagcga accgaaacgt ccgttttgca 960 

tccgaaacag tatag 9^5 

<210> 230 

<211> 324 

<212> PRT 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 230 

Met Pro Glu Asn Lys Val Lys Val Ala Leu Val Gin His Pro Pro Val 

15 10 15 

Phe Leu Asn Leu Pro Lys Thr Leu Glu Lys Val Glu Gly Leu Ala Arg 

20 25 30 

Glu Cys Ala Ala Asn Glu Ala Lys He Val Val Phe Pro Glu Thr Trp 

35 40 45 

Leu Thr Gly Tyr Pro Val Trp Leu Asp Glu Ala Pro Arg Ala Ala Leu 

50 55 60 

Trp Asp Tyr Pro Pro Ala Lys Arg Leu Tyr Gin Tyr Leu Thr Glu Asn 
65 70 75 80 

Ser Leu Gin He Pro Ser Ala Glu Phe Glu Ser Leu Arg Glu He Ala 

85 90 95 

Lys Lys Asn Ser Leu Tyr Leu Val Val Gly Val His Glu Arg Ser Gly 

100 105 110 

Gly Thr Leu Tyr Asn Thr He He Tyr Leu Thr Pro Asp Gly Ser Tyr 

115 120 125 

Lys Thr His Arg Lys Leu Val Pro Thr Tyr Thr Glu Arg Leu Val Trp 

130 135 140 

Gly Ala Gly Asp Gly Ser Gly Leu Asn Val Val Glu Thr Pro Tyr Gly 
145 150 155 160 

He Leu Gly Gly Leu He Cys Trp Glu His Trp Met Pro Leu Ala Arg 

165 170 175 

Ala Ala Met His Ser Lys Asn Glu Ala He His Val Cys Gin Phe Pro 

180 185 190 

Thr Val His Glu Arg His Gin He Ala Ser Arg His Tyr Ala Phe Glu 

195 200 205 

Gly Gin Cys Phe Val Leu Thr Ser Gly Cys Ala Met Thr Lys Thr Asp 

210 215 220 

Val Leu Glu Gly Phe Glu Ser Leu Glu Thr Asn Asp His Glu Val Phe 
225 230 235 240 

Gly Leu Leu Asp Ser He Glu Lys Glu Glu Leu Met Arg Gly Gly Ser 

245 250 255 

Ala He He Ala Pro Asp Leu Ser Tyr Ser Val Glu Pro Val Phe Asp 

260 265 270 

Glu Lys Thr He Val Tyr Gly Glu Leu Asn Leu Asp Leu Thr Lys Gin 

275 280 285 

Gly His Leu Phe Leu Asp Thr Asp Gly His Tyr Ser Arg Pro Asp Val 
290 295 300 
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Phe Glu Leu Arg Val Asn Asp Lys Ala Asn Arg Asn Val Arg Phe Ala 
305 310 315 320 

Ser Glu Thr Val 



<210> 231 
<211> 1062 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 231 

atgagtcaga tccatccaaa acttaaggtc gcggccgtgc aggctgcccc cgcctttctc 60 

gatctcgatg catcgatcga aaagacaata cgctatgtcg acgaagcggc tgcggccggg 120 

gcgaagttga ttgcgtttcc ggaaacctgg attcccggct acccatggtg gatctggctc 180 

ggcgctccgg cctgggcgat catgcgtggc ttcgtctcgc gctatttcga caactcgctg 240 

caatatggca gtccggaagc tgaacggctg cgggacgccg ccaggcgcaa caagatatac 300 

atcgccctcg gcctgtccga gcgcgacggg ggcagtcttt atatcgcgca atggatcatc 360 

gggcctggcg gcgaaacggt tgcacaacgc cgcaagctca agcccacgca cgccgagcgc 420 

actgtattcg gcgaaggcga tggttcacat ctggccgtgc atgatctcga tattggaaga 480 

ttgggcgcgc tttgttgctg ggaacatctg caaccgttgt cgaaatatgc aatgtacgcc 540 

cagaacgagc aaattcacgt cgccgcctgg ccgagcttct cgctatacga tccctttgca '600 

cacgcactcg gcgccgaggt caataacgct gcgagcaaga tctatgcggt cgagggatcg 660 

tgcttcgtca ttgcgccgtg cgcaacggtt tcgcaggtga tgatcgatga gctctgcgat 720 

acccccgaaa agcatcaatt ccttcacgtc ggcggcggct tcgccgtcat ttacggtccc 7 80 

gacggctcgc cactggccaa acctcttccg ccagaccagg agggacttct ctatgccgac 840 

atcgatctcg gcatgatctc ggtcgccaag gccgcagccg atcccgccgg acattatgca 900 

cgtcccgatg taactcgcct gctgttcaac aatcgcccgg caaaccgcgt cgagaagctg 960 

gcgttgccgg tcgatcagga ggccgaagtg gatagtccgc tgaaggctcc cgacgcatct 1020 

cccaaagtga cggcgctcaa gccgtcgcag gctgcggagt ag 1062 

<210> 232 
<211> 353 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 232 



Met 


Ser 


Gin 


He 


His 


Pro 


Lys 


Leu 


Lys 


Val 


Ala Ala Val Gin Ala Ala 


1 








5 










10 


15 


Pro 


Ala 


Phe 


Leu 


Asp 


Leu 


Asp 


Ala 


Ser 


He 


Glu Lys Thr He Arg Tyr 








20 










25 




30 


Val 


Asp 


Glu 


Ala 


Ala 


Ala 


Ala 


Gly 


Ala 


Lys 


Leu He Ala Phe Pro Glu 




35 










40 






45 


Thr 


Trp 


lie 


Pro 


Gly 


Tyr 


Pro 


Trp 


Trp 


He 


Trp Leu Gly Ala Pro Ala 




50 










55 








60 


Trp 


.Ala 


He 


Met 


Arg 


Gly 


Phe 


Val 


Ser 


Arg 


Tyr Phe Asp Asn Ser Leu 


65 










70 










75 80 


Gin 


Tyr 


Gly 


Ser 


Pro 


Glu 


Ala 


Glu 


Arg 


Leu 


Arg Asp Ala Ala Arg Arg 






85 










90 


95 


Asn 


Lys 


He 


Tyr 


He 


Ala 


Leu 


Gly 


Leu 


Ser 


Glu Arg Asp Gly Gly Ser 






100 










105 




110 


Leu 


Tyr 


He 


Ala 


Gin 


Trp 


He 


He 


Gly 


Pro 


Gly Gly Glu Thr Val Ala 




115 










120 






125 


Gin 


Arg 


Arg 


Lys 


Leu 


Lys 


Pro 


Thr 


His 


Ala 


Glu Arg Thr Val Phe Gly 




130 










135 








140 


Glu 


Gly 


Asp 


Gly 


Ser 


His 


Leu 


Ala 


Val 


His 


Asp Leu Asp He Gly Arg 



Page 152 



WO 03/097810 



PCT/US03/15712 



145 






150 


Leu Gly 


Ala 


Leu Cys Cys 








165 


Ala 


Met 


Tvr 


Ala Gin Asn 








180 


Phe 


Ser 


Leu 


Tyr Asp Pro 






195 




Asn 


Ala 


Ala 


Ser Lys lie 




210 






Ala 


Pro 


Cys 


Ala Thr Val 


225 






230 


Thr 


Pro 


Glu 


Lys His Gin 








245 


lie 


Tyr 


Gly 


Pro Asp Gly 








260 


Gin 


Glu 


Gly 


Leu Leu Tyr 






275 




Ala 


Lys 


Ala 


Ala Ala Asp 




290 






Thr 


Arg 


Leu 


Leu Phe Asn 


305 




310 


Ala 


Leu 


Pro 


Val Asp Gin 








325 


Pro Asp 


Ala 


Ser Pro Lys 








340 


Glu 









Trp Glu 

Glu Gin 

Phe Ala 
200 
Tyr Ala 
215 

Ser Gin 

Phe Leu 

Ser Pro 

Ala Asp 
280 
Pro Ala 
295 

Asn Arg 
Glu Ala 
Val Thr 



155 

His Leu Gin Pro 
170 

lie His Val Ala 
185 

His Ala Leu Gly 

Val Glu Gly Ser 
220 

Val Met lie Asp 
235 

His Val Gly Gly 
250 

Leu Ala Lys Pro 
265 

lie Asp Leu Gly 

Gly His Tyr Ala 
300 

Pro Ala Asn Arg 
315 

Glu Val Asp- Ser 
330 

Ala Leu Lys Pro 
345 



160 

Leu Ser Lys Tyr 
175 

Ala Trp Pro Ser 
190 

Ala Glu Val Asn 
205 

Cys Phe Val He 

Glu Leu Cys Asp 
240 

Gly Phe Ala Val 
255 

Leu Pro Pro Asp 
270 

Met He Ser Val 
285 

Arg Pro Asp Val 

Val Glu Lys Leu 
320 

Pro Leu Lys Ala 
335 

Ser Gin Ala Ala 
350 



<210> 233 
<211> 1002 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 233 

gtgagcacca tcgtcaaagc cgcggccgtg cagatcagcc ccgtgctgta cagccgcgag 60 

ggcaccgtcg agcgggtcgt gaagaagatc cgggagctgg gcgaaaaggg cgtccagttc 120 

gccaccttcc ccgagaccgt catcccttac tacccgtact tttccttcgt tcagacgccc 180 

ttgcagatcc tcgccggccc cgagcatctg aagctgctcg accagtcggt gaccgtgccg 240 

tcccccgcca cggacgcgat cggccaggcc gcccggcagg caggaatggt ggtgtccatc 300 

ggcgtcaacg agcgtgacgg cggcaccctg tacaacacgc agctgctctt cgacgcggac 360 

ggcgcgctga tccagcgtcg ccgcaagatc aagcccaccc actacgagcg catgatctgg 420 

ggcgagggcg acggctccgg cctgcgcgcc gtcgacagcc aggtcggtcg tatcggccag 480 

ctggcctgct gggagcacaa caaccccctg gcgcgctacg ccatgatggc cgacggcgag 540 

cagatccatt cggccatgta tccgggctcg atgttcggcg acccgttcgc ccagaagacg 600 

gaaatcaaca tccggcagca tgcgctggaa tccggatgct tcgtcgtctg ctcgacggcc 660 

tggttggacg ccgatcagca ggcgcaaatc atgcaggaca cgggctgcgc catcggcccg 720 

atctcgggcg gctgcctcac ggcgatcgtg gcgcccgacg gcacgttcct gggcgaaccg 780 

ctcacgtcgg gcgagggcga ggtcatcgcc gacctcgatt tcaagctgat cgacaagcgc 840 

aagcagacga tggactcgcg cggccactac aaccgccccg aactgctcag cctgctgatc 900 

gatcgaacgc cgacgtcgaa cgtocatgag cgcgccgcgc acccgaaggt cgaggcgtca 960 

caaacggctg gcgacacgga gcggacccgc gaggtcctgt aa 1002 

<210> 234 

<211> 333 

<212> PRT 

<213> Unknown 

<220> 
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<223> Obtained from an environmental sample 
<400> 234 

Val Ser Thr lie Val Lys Ala Ala Ala Val Gin He Ser Pro Val. Leu 

15 10 15 

Tyr Ser Arg Glu Gly Thr Val Glu Arg Val Val Lys Lys He Arg Glu 

20 25 30 

Leu Gly Glu Lys Gly Val Gin Phe Ala Thr Phe Pro Glu Thr Val He 

35 40 45 

Pro Tyr Tyr Pro Tyr Phe Ser Phe Val Gin Thr Pro Leu Gin He Leu 

50 55 60 

Ala Gly Pro Glu His Leu Lys Leu Leu Asp Gin Ser Val Thr Val Pro 
65 70 75 80 

Ser Pro Ala Thr Asp Ala He Gly Gin Ala Ala Arg Gin Ala Gly Met 

85 90 95 

Val Val Ser He Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn 

100 105 110 

Thr Gin Leu Leu Phe Asp Ala Asp Gly Ala Leu He Gin Arg Arg Arg 

115 120 125 

Lys He Lys Pro Thr His Tyr Glu Arg Met He Trp Gly Glu Gly Asp 

130 135 140 

Gly Ser Gly Leu Arg Ala Val Asp Ser Gin Val Gly Arg lie Gly Gin 
145 150 155 160 

Leu Ala Cys Trp Glu His Asn Asn Pro Leu Ala Arg Tyr Ala Met Met 

165 170 175 

Ala Asp Gly Glu Gin He His Ser Ala Met Tyr Pro Gly Ser Met Phe 

180 185 190 

Gly Asp Pro Phe Ala Gin Lys Thr Glu He Asn He Arg Gin His Ala 

195 200 205 

Leu Glu Ser Gly Cys Phe Val Val Cys Ser Thr Ala Trp Leu Asp Ala 

210 215 220 

Asp Gin Gin Ala Gin He Met Gin Asp Thr Gly Cys Ala He Gly Pro 
225 230 235 240 

He Ser Gly Gly Cys Leu Thr Ala He Val Ala Pro Asp Gly Thr Phe 

245 250 255 

Leu Gly Glu Pro Leu Thr Ser Gly Glu Gly Glu Val He Ala Asp Leu 

260 265 270 

Asp Phe Lys Leu He Asp Lys Arg Lys Gin Thr Met Asp Ser Arg Gly 

275 280 285 

His Tyr Asn Arg Pro Glu Leu Leu Ser Leu Leu He Asp Arg Thr Pro 

290 295 300 

Thr Ser Asn Val His Glu Arg Ala Ala His Pro Lys Val Glu Ala Ser 
305 310 315 320 

Gin Thr Ala Gly Asp Thr Glu Arg Thr Arg Glu Val Leu 
325 330 

<210> 235 
<211> 993 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 235 

atgaaaattg ttaaagcggc agcagttcag ctgagccccg tcctctatag ccgcgagggg 60 

acggtcgaaa gagtagtgcg gaagattcat caacttggtc aacagggagt gcagtttgcc 120 

accttcccgg aaacagtggt gccttactac ccgtattttt cgatcgtgca gtccgggtat 180 

caaatccttc gcggcggtga gttcgtgaag ctgctcgatc agtcagtgac ggtgccatct 240 

cttgccaccg aagcgatcgc cgaggcctgc aggcaggcgg gcgtcgttgt ctccatcggc 300 

gtcaatgagc gtgacggcgg aactatatac aatgcgcagc ttctgtttga ttcggacggc 360 
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acattgattc agaggcgacg caagatcacg cccacccact acgagcgcat gatctggggc 420 

cagggcgatg gctcgggtct gcgggctgtg gacagcaagg tggcacgtat tggtcaactg 480 

gcgtgctttg agcattacaa ccccctcgca cgatacgcga tgatcgccga tggcgagcag 540 

atccactctg caatgtttcc cggttccatg ttcggcgatg gtttcgcgga gaggaccgag 600 

atcgcggtca ggcagcatgc gcaggagtcc ggatgctttg tagtttgtgc tacggcgtgg 660 

ctggatgccg accagcaggc tcaaattgcc gcggacacag gcatcaccga cctgggaccg 720 

atctccggcg gttgcttcac tgcgatcatt gcacctgatg ggagcctgct gggtcaacca 780 

atccgctcgg gcgaaggtga cgtcattgtc gatctcgatt tcactctgat cgacaggcgg 840 

aagcatgttg tggactcgag aggtcactac agccggccgg aattgctaag cctgctgatc 900 

gaccgtactc ccacagcgca cgttcacgaa cgggccgcgc actctcactt ggccgccgag 960 

caatgcttgg aggatcttaa cgcgcttgct taa 993 

<210> 236 
<211> 330 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 236 



Met 


Lys 


He 


Val 


Lys 


Ala 


Ala 


Ala 


Val Gin Leu Ser Pro Val Leu Tyr 










5 








10 15 


Ser 


Arg 


Glu Gly 


Thr 


Val 


Glu 


Arg 


Val Val Arg Lys He His Gin Leu 








20 










25 30 


Glv 


Gin 


Gin 


Gly 


Val 


Gin 


Phe 


Ala 


Thr Phe Pro Glu Thr Val Val Pro 






35 










40 


45 




Tvt 


Pro 


Tyr 


Phe 


Ser 


He 


Val 


Gin S&r Glv Tvr Gin He Leu Ara 




50 










55 




60 


Gly 


Gly 


Glu 


Phe 


Val 


Lys 


Leu 


Leu 


Asp Gin Ser Val Thr Val Pro Ser 


65 










70 






75 80 


Leu 


Ala 


Thr 


Glu 


Ala 


He 


Ala 


Glu 


Ala Cys Arg Gin Ala Gly Val Val 










85 








90 95 


Val 


Ser 


He Gly 


Val 


Asn 


Glu 


Arg 


Asp Gly Gly Thr He Tyr Asn Ala 








100 










105 110 


Gin 


Leu 


Leu 


Phe 


Asp 


Ser 


Asp 


Gly 


Thr Leu He Gin Arg Arg Arg Lys 






115 










120 


125 


lie 


Thr 


Pro 


Thr 


His 


Tyr 


Glu 


Arg 


Met He Trp Gly Gin Gly Asp Gly 




130 










135 




140 


Ser 


Gly 


Leu 


Arg 


Ala 


Val 


Asp 


Ser 


Lys Val Ala Arg He Gly Gin Leu 


145 










150 






155 160 


Ala 


Cys 


Phe 


Glu 


His 


Tyr 


Asn 


Pro 


Leu Ala Arg Tyr Ala Met He Ala 










165 








170 175 


Asp 


Gly 


Glu 


Gin 


lie 


His 


Ser 


Ala 


Met Phe Pro Gly Ser Met Phe Gly 








180 










185 190 


Asp 


Gly 


Phe 


Ala 


Glu 


Arg 


Thr 


Glu 


He Ala Val Arg Gin His Ala Gin 






195 










200 


205 


Glu 


Ser 


Gly Cys 


Phe 


Val 


Val 


Cys 


Ala Thr Ala Trp Leu Asp Ala Asp 




210 










215 




220 


Gin 


Gin 


Ala 


Gin 


He 


Ala 


Ala 


Asp 


Thr Gly He Thr Asp Leu Gly Pro 


225 










230 






235 240 


He 


Ser 


Gly Gly 


Cys 


Phe 


Thr 


Ala 


He He Ala Pro Asp Gly Ser Leu 










245 








250 255 


Leu 


Gly 


Gin 


Pro 


He 


Arg 


Ser 


Gly 


Glu Gly Asp Val He Val Asp Leu 








260 










265 270 


Asp 


Phe 


Thr 


Leu 


He 


Asp 


Arg 


Arg 


Lys His Val Val Asp Ser Arg Gly 






275 










280 


285 


His 


Tyr 


Ser Arg 


Pro 


Glu 


Leu 


Leu 


Ser Leu Leu He Asp Arg Thr Pro 




290 










295 




300 


Thr 


Ala 


His 


Val 


His 


Glu 


Arg 


Ala 


Ala His Ser His Leu Ala Ala Glu 


305 










310 






315 320 
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Gin Cys Leu Glu Asp Leu Asn Ala Leu Ala 
325 330 

<210> 237 
<211> 993 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 237 

atgaaagtcg tcaaagccgc cgcagtgcag atcagtcctg ttctctacag ccgagaggca 60 

accgtcgcca aggtcgtgca gaagatccac gaactcggcc agaaaggcgt gcagttcgcc 120 

acctttccag agaccgtagt gccttactac ccgtactttt ccgccgtcca gacaggcatc 180 

gagcttctgt ccggcacgga gcatctccgg ctgctcgatc aggccgtgac ggtgccgtct 240 

gccgctaccg acgcgatcgg agaggcagcc cggaaggcag gcatggtggt gtcgatcggc 300 

gtcaatgaac gcgatggcgg caccttgtac aacacacagt tgctcttcga tgccgatggc 360 

accttgatcc agcgccgccg caagatcacg ccgacccact tcgaacgcat gatctggggc 420 

cagggggacg gttcgggcct gcgcgctgtc gacagcaagg tcggtcgcat tggccagttg 480 

gcctgcttcg agcacaacaa cccgctggcc cgctacgcgt tgattgccga cggcgagcag 540 

atccattccg ccatgtatcc gggttctgct tttggcgaag gatttgccca aaggatggaa 600 

atcaatatcc gccagcatgc gctggagtcg ggtgcgttcg tcgtcaacgc aacggcctgg 660 

ctggatgctg accagcaggc gcagatcatg aaggacaccg gctgcgggat tggcccgatc 720 

tcgggcggtt gcttcaccac gatcgtgtca cccgacggca tgctgatggc cgaacccctg 780 

cgctcgggcg agggtgaggt catcgtcgat ctcgacttca cgctgatcga ccgtcgcaag 840 

atgttgatgg actcggcggg ccactataac cgcccggaac tgctcagtct catgatcgac 900 

cgcaccccga ccgcgcacgt tcatgaacgc gctgcgcgtc cggtgtcggg cgttgagcag 960 

aacccggagg aacttcgcat cccggccgcg tga 993 

<210> 238 
<211> 330 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 238 



Met 


Lys 


Val 


Val 


Lys 


Ala 


Ala 


Ala 


Val 


Gin 


He Ser Pro Val Leu Tyr 


1 






5 










10 


15 


Ser 


Arg 


Glu 


Ala 


Thr 


Val 


Ala 


Lys 


Val 


Val 


Gin Lys He His Glu Leu 






20 










25 




30 


Gly 


Gin 


Lys 


Gly 


Val 


Gin 


Phe 


Ala 


Thr 


Phe 


Pro Glu Thr Val Val Pro 




35 










40 






45 


Tyr 


Tyr 


Pro 


Tyr 


Phe 


Ser 


Ala 


Val 


Gin 


Thr 


Gly He Glu Leu Leu Ser 


50 








55 








60 


Gly 


Thr 


Glu 


His 


Leu 


Arg 


Leu 


Leu 


Asp 


Gin 


Ala Val Thr Val Pro Ser 


65 










70 










75 80 


Ala 


Ala 


Thr 


Asp 


Ala 


He 


Gly 


Glu 


Ala 


Ala 


Arg Lys Ala Gly Met Val 








85 










90 


95 


Val 


Ser 


He 


Gly 


Val 


Asn 


Glu 


Arg 


Asp Gly 


Gly Thr Leu Tyr Asn Thr 








100 










105 




110 


Gin 


Leu 


Leu 


Phe 


Asp 


Ala 


Asp 


Gly 


Thr 


Leu 


He Gin Arg Arg Arg Lys 






115 










120 






125 


He 


Thr 


Pro 


Thr 


His 


Phe 


Glu 


Arg 


Met 


He 


Trp Gly Gin Gly Asp Gly 




130 










135 








140 


Ser 


Gly 


Leu 


Arg 


Ala 


Val 


Asp 


Ser 


Lys 


Val 


Gly Arg He Gly Gin Leu 


145 






150 










155 160 


Ala 


Cys 


Phe 


Glu 


His 


Asn 


Asn 


Pro 


Leu 


Ala 


Arg Tyr Ala Leu He Ala 








165 










170 


175 
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Asp 


Gly 


Glu 


Gin 
180 


lie 


His 


Ser 


Ala 


Met 
185 


Tyr 


Pro Gly 


Ser Ala Phe Gly 
190 


Glu Gly 


Phe 


Ala 


Gin 


Arg 


Met 


Glu 


He 


Asn 


He Arg 


Gin His Ala Leu 






195 










200 








205 


Glu 


Ser 


Gly Ala 


Phe 


Val 


Val 


Asn 


Ala 


Thr 


Ala Trp 


Leu Asp Ala Asp 




210 










215 








220 




Gin 


Gin 


Ala 


Gin 


lie 


Met 


Lvs 


Asp 


Thr 


Gly 


Cys Gly 


He Gly Pro He 


225 










230 










235 


240 


Ser Gly 


Gly Cys 


Phe 


Thr 


Thr 


He 


Val 


Ser 


Pro Asp 


Gly Met Leu Met 










245 










250 




255 


Ala 


Glu 


Pro 


Leu 
260 


Arg 


Ser 


Gly 


Glu 


Gly 
265 


Glu 


Val He 


Val Asp Leu Asp 
270 


Phe 


Thr 


Leu 

275 


lie 


Asp 


Arg 


Arg 


Lys 
280 


Met 


Leu 


Met Asp 


Ser Ala Gly His 
285 


Tyr 


Asn 


Arg 


Pro 


Glu 


Leu 


Leu 


Ser 


Leu 


Met 


He Asp 


Arg Thr Pro Thr 


290 










295 








300 




Ala 


His 


Val 


His 


Glu 


Arg 


Ala 


Ala 


Arg 


Pro 


Val Ser 


Gly Val Glu Gin 


305 










310 










315 


320 


Asn 


Pro 


Glu 


Glu 


Leu 

325 


Arg 


He 


Pro 


Ala 


Ala 
330 







<210> 239 
<211> 969 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 239 

atgtccaatg agaataagta tgacacattt aaagttgctg cagtccaggc cacacctgtg 60 

tttcttgatc gtgaagcaac catcgacaaa gcttgcgagt tgattgctac tgcaggtcgt 120 

gaaggtgctc gcctgattgt ctttccagaa gcgttcatcc catcctatcc cgagtgggta 180 

tggggtattc cctctggtga gcaaggttta ctcaacgaac tttattcaga gttgctcacc 240 

aatgcggtta ccatacccag cgacgcgact gacagattgt gcgaggcggc aaagcttgcg 300 

aatgcctatg tagtgatggg aatgagcgaa cggaatgtcg aagcgagtgg tgcaagcctg 360 

tataacacga tgttgtatat agatgcacag ggggagattt tagggaaaca tcggaagctg 420 

gtgccaacgg gtggtgaacg cctggtatgg gcgcaaggtg atggcagcac gctgcaggtc 480 

tacgatactc cattgggaaa acttggtggg ttaatttgct gggaaaatta tatgccgctg 540 

gcacgctata cgatgtatgc ctggggaaca caaatctatg ttgcagcaac gtgggattgc 600 

ggccaaccct ggctctcaac gatacggcat attgctaaag aaggcagggt atacgtggtt 660 

ggttgctgta tcgcgatgcg taaagatgat attccagatc gttactctat gaagcagaaa 720 

tattatgctg aaatggatga atggataaat gttggggata gcgcgattgt caatcccgaa 780 

ggacatttta ttgcagggcc tgtgcgcaag caagaagaaa ttctctatgc ggagattgat 840 

ccacgtatga tgcaaggccc gaagtggatg cttgacgtgg cgggacatta tgcaagacca 900 

gatgtgttcc agttgacggt gcatacggat gtgaggcaga tgatacgggt ggaagatgat 960 

tctcaatga 969 

<210> 240 
<211> 322 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 240 

Met Ser Asn Glu Asn Lys Tyr Asp Thr Phe Lys Val Ala Ala Val Gin 

15 10 15 

Ala Thr Pro Val Phe Leu Asp Arg Glu Ala Thr He Asp Lys Ala Cys 
20 25 30 
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Glu Leu He Ala Thr Ala Gly Arg Glu Gly Ala Arg Leu He Val Phe 

35 40 45 

Pro Glu Ala Phe He Pro Ser Tyr Pro Glu Trp Val Trp Gly He Pro 

50 55 60 

Ser Gly Glu Gin Gly Leu Leu Asn Glu Leu Tyr Ser Glu Leu Leu Thr 
65 70 75 80 

Asn Ala Val Thr He Pro Ser Asp Ala Thr Asp Arg Leu Cys Glu Ala 

85 90 95 

Ala Lys Leu Ala Asn Ala Tyr Val Val Met Gly Met Ser Glu Arg Asn 

100 105 110 

Val Glu Ala Ser Gly Ala Ser Leu Tyr Asn Thr Met Leu Tyr He Asp 

115 120 125 

Ala Gin Gly Glu He Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly 

130 135 140 

Gly Glu Arg Leu Val Trp Ala Gin Gly Asp Gly Ser Thr Leu Gin Val 
145 150 155 160 

Tyr Asp Thr Pro Leu Gly Lys Leu Gly Gly Leu He Cys Trp Glu Asn 

165 170 175 

Tyr Met Pro Leu Ala Arg Tyr Thr Met Tyr Ala Trp Gly Thr Gin He 

180 185 190 

Tyr Val Ala Ala Thr Trp Asp Cys Gly Gin Pro Trp Leu Ser Thr He 

195 200 205 

Arg His He Ala Lys Glu Gly Arg Val Tyr Val Val Gly Cys Cys He 

210 215 220 

Ala Met Arg Lys Asp Asp He Pro Asp Arg Tyr Ser Met Lys Gin Lys 
225 230 235 240 

Tyr Tyr Ala Glu Met Asp Glu Trp He Asn Val Gly Asp Ser Ala He 

245 250 255 

Val Asn Pro Glu Gly His Phe He Ala Gly Pro Val Arg Lys Gin Glu 

260 265 270 

Glu He Leu Tyr Ala Glu He Asp Pro Arg Met Met Gin Gly Pro Lys 

275 280 285 

Trp Met Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Gin 

290 295 300 

Leu Thr Val His Thr Asp Val Arg Gin Met He Arg Val Glu Asp Asp 
305 310 315 320 

Ser Gin 



<210> 241 
<211> 972 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 241 

atgatatcca acgagcataa caatactcca ttcaaggttg ctgctgtgca ggctacacct 60 

gtgtttcttg atcgcgaagc aacgatcgat aaagcgtgtg aactgatcgc tactgccggt 120 

catgaaggcg ctcgtttgat tgtttttcca gaagcgttca tcccatccta tcccgagtgg 180 

gtatggggaa ttccctctgg cgagcaaggt ttgctcaacg atctgtatgc agagttactc 240 

accaattcag ttacgatacc cagcaacgca actgacaggc tttgtagagc cgcgaagctt 300 

gctaatgcct acgtggtgat ggggatgagc gaacggaata tcgaagcgag cggcgcaagc 360 

ctgtacaata cgatgttata tatagatgca cagggtgaga ttttgggcaa acatcgaaag 420 

ctggtgccaa cgggcggaga acgcctggta tgggcacaag gagatggaag cacgctgcag 480 

gtttacgata cacctctagg aaagcttggt ggtttaattt gctgggaaaa ttatatgccg 540 

ctggcacgct acgctatgta tgcctgggga actcaaatct acgtcgcggc aacgtgggat 600 

cgcggccaac cctggctctc aacgatacgg catatcgcta aagagggcag ggtatacgta 660 

atcggttgct gtatcgcgat gcgtaaagac gatattccag ataggtactc catgaagcag 720 

aagtattatg cggagatgga tgaatggatc aacgtaggtg acagcgcgat tgtcaatcct 780 
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gagggggact tcattgcggg gcctgtgagc aagcaggagg aaattctcta tgcggagatt 
gatccgcgga tggtgcaagg tccgaagtgg atgctggatg tggcggggca ttacgcgagg 
cctgatgtgt tcgagttgac ggtgcatacg gatgtgaggg agatgatgcg ggtggagcat 
gattatcaat ga 

<210> 242 
<2U> 323 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<400> 242 m , _ ^_ „ . 

Met He Ser Asn Glu His Asn Asn Thr Pro Phe Lys Val Ala Ala Val 

1 5 10 I 5 

Gin Ala Thr Pro Val Phe Leu Asp Arg Glu Ala Thr He Asp Lys Ala 

20 25 30 

Cys Glu Leu He Ala Thr Ala Gly His Glu Gly Ala Arg Leu He Val 

35 40 45 

Phe Pro Glu Ala Phe He Pro Ser Tyr Pro Glu Trp Val Trp Gly He 

50 55 60 

Pro Ser Gly Glu Gin Gly Leu Leu Asn Asp Leu Tyr Ala Glu Leu Leu 
65 70 75 80 

Thr Asn Ser Val Thr He Pro Ser Asn Ala Thr Asp Arg Leu Cys Arg 

85 90 95 

Ala Ala Lys Leu Ala Asn Ala Tyr Val Val Met Gly Met Ser Glu Arg 

100 105 U0 

Asn He Glu Ala Ser Gly Ala Ser Leu Tyr Asn Thr Met Leu Tyr He 

115 120 125 

Asp Ala Gin Gly Glu He Leu Gly Lys His Arg Lys Leu Val Pro Thr 

130 135 140 

Gly Gly Glu Arg Leu Val Trp Ala Gin Gly Asp Gly Ser Thr Leu Gin 
145 150 155 160 

Val Tyr Asp Thr Pro Leu Gly Lys Leu Gly Gly Leu He Cys Trp Glu 

165 170 175 

Asn Tyr Met Pro Leu Ala Arg Tyr Ala Met Tyr Ala Trp Gly Thr Gin 

180 185 190 

He Tyr Val Ala Ala Thr Trp Asp Arg Gly Gin Pro Trp Leu Ser Thr 

195 200 205 

He Arg His He Ala Lys Glu Gly Arg Val Tyr Val He Gly Cys Cys 

210 215 220 

He Ala Met Arg Lys Asp Asp He Pro Asp Arg Tyr Ser Met Lys Gin 
225 230 235 240 

Lys Tyr Tyr Ala Glu Met Asp Glu Trp He Asn Val Gly Asp Ser Ala 

245 250 255 

He Val Asn Pro Glu Gly Asp Phe He Ala Gly Pro Val Ser Lys Gin 

260 265 270 

Glu Glu He Leu Tyr Ala Glu He Asp Pro Arg Met Val Gin Gly Pro 

275 280 285 

Lys Trp Met Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe 

290 295 300 

Glu Leu Thr Val His Thr Asp Val Arg Glu Met Met Arg Val Glu His 
305 310 315 320 

Asp Tyr Gin 



840 
900 
960 
972 



<210> 243 
<211> 999 
<212> DMA 
<213> Unknown 
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60 



<220> 

<223> Obtained from an environmental sample 
<400> 243 

atgaagcaaa ctcgagtagc gatcattcag gcggaacctg tatatctcaa tttgcaagcg 

agtgttgcga gggctatcga tcttgccgga cgcgccgcga agcaaggagc gcgtctgata 120 

gtgtttggag agacctggtt gccgggttat cccgcgtggc tggattactg tcccggcatg 180 

gcgttctggg atcaccggcc gacaaaagaa gtgtttgccc ggacccgcga gaacagtgtt 240 

gtaattccgg gaaaggaaat cgaacagctc tgtaaaactg cggcggagct gggagttgta 300 

atttcgatcg gtgtaaacga aaaaattctg gaaggccccg gaaacggcac gctctacaat 360 

tctcttttgc tgattgatga atcaggaaaa ctggccggcc atcaccgcaa actggttccg 420 

acttatacgg aacggatggt gtggggaatg ggtgatggag gtggaatgga agccatatcg 480 

actgcagccg gcagggttgg cggattgatt tgctgggagc actggatgcc attgagccgg 540 

caggtgctgc acatgtcggg tgaggaaatt catgtggcag tgtggcccac ggttcatgag 600 

gtgcaccagc ttgcatcacg ccattatgca tttgaagggc gttgttttgt gctcgcagcc 660 

ggattgttga tgaaggtccg ggatattcct ccggagctgg aattgccttc tcagatgtcg 720 

cgtgaatccg aagactggct tctgcgcggc gggagcgccg tcattggtcc ggatggaaag 780 

tacattgtgg agccgttgtt tgatcgagag gcgattctca cagccgatct tgaattagcc 840 

gcatgcgatc gtgaaaaaat gacgctggac gtaacgggac attattcccg ccccgatctt 900 

tttcacctgg aattcaggaa acagcaatcc ggccatattg cgggagcagg aacgatcagc 960 

cggcaaaaat cagcgccgga ccgcgcggac gatcactaa 999 

<210> 244 
<211> 332 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 244 

Met Lys Gin Thr Arg Val Ala lie He Gin Ala Glu Pro Val Tyr Leu 

15 10 15 

Asn Leu Gin Ala Ser Val Ala Arg Ala He Asp Leu Ala Gly Arg Ala 

20 25 30 

Ala Lys Gin Gly Ala Arg Leu He Val Phe Gly Glu Thr Trp Leu Pro 

35 40 45 

Gly Tyr Pro Ala Trp Leu Asp Tyr Cys Pro Gly Met Ala Phe Trp Asp 

50 55 60 

His Arg Pro Thr Lys Glu Val Phe Ala Arg Thr Arg Glu Asn Ser Val 
65 70 75 80 

Val He Pro Gly Lys Glu He Glu Gin Leu Cys Lys Thr Ala Ala Glu 

85 90 95 

Leu Gly Val Val He Ser He Gly Val Asn Glu Lys He Leu Glu Gly 

100 105 HO 

Pro Gly Asn Gly Thr Leu Tyr Asn Ser Leu Leu Leu He Asp Glu Ser 

115 120 125 

Gly Lys Leu Ala Gly His His Arg Lys Leu Val Pro Thr Tyr Thr Glu 

130 135 140 

Arg Met Val Trp Gly Met Gly Asp Gly Gly Gly Met Glu Ala He Ser 
145 150 155 160 

Thr Ala Ala Gly Arg Val Gly Gly Leu He Cys Trp Glu His Trp Met 

165 170 175 

Pro Leu Ser Arg Gin Val Leu His Met Ser Gly Glu Glu He His Val 

180 185 190 

Ala Val Trp Pro Thr Val His Glu Val His Gin Leu Ala Ser Arg His 

195 200 205 

Tyr Ala Phe Glu Gly Arg Cys Phe Val Leu Ala Ala Gly Leu Leu Met 

"210 215 220 

Lys Val Arg Asp He Pro Pro Glu Leu Glu Leu Pro Ser Gin Met Ser 
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225 








2 JO 




235 




240 


Arg Glu 


Ser 


Glu Asp 


Trp 


Leu 


Leu Arg Gly Gly Ser Ala Val 


He Gly 






245 






250 


255 




Pro Asp 


Gly 


Lys 


Tyr 


He 


Val 


Glu Pro Leu Phe Asp Arg Glu 


Ala 


He 


260 








265 270 






Leu Thr 


Ala 


Asp 


Leu 


Glu 


Leu 


Ala Ala Cys Asp Arg Glu Lys Met 


Thr 




275 










280 285 






Leu Asp 


Val 


Thr 


Gly 


His 


Tyr 


Ser Arg Pro Asp Leu Phe His 


Leu 


Glu 


290 










295 


300 






Phe Arg 


Lys 


Gin 


Gin 


Ser 


Gly 


His lie Ala Gly Ala Gly Thr 


He 


Ser 


305 






310 




315 




320 


Arg Gin 


Lys 


Ser 


Ala 
325 


Pro 


Asp 


Arg Ala Asp Asp His 
330 







<210> 245 
<211> 999 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 245 

atgggtgaga atgccaactt caccgtcgca gctgtccagg caacgccggt cttcttagac 60 

cgggatgcga cggtcgagaa ggcttgcgag ctcatcgctg aagccgggcg aaacggagcg 120 

cgcctggccg tttttcccga ggcgtttgtg ccggcttacc cggactgggt ctgggctgtt 180 

ccgccgggcg attcaaggct gctgcacgag ctctacggtg agctgatcca gaactctgtc 240 

acgattccca gcgagtcgac ggagaaactc tgccgggccg cccgcggggc caaagtctgc 300 

gtggcgatcg gcatcaacga gaggaatgcg gaggcaagcg ggggtagcct ctacaacagc 360 

ctcctgtaca tcagcccgga cggccaggtc ctcgggaagc accgcaagct cgttcccacc 420 

ggagcggagc ggcttgtctg ggcgcagggc gacggcagca ctatcgacgt gtttgagttg 480 

cctttctgtc gtttgggtgg cctcatctgt tgggagaact acatgccgct ggcccgttat 540 

gcgatgtacg cctggggcac gcaggtctac gtcgcggcaa cgtgggacca cggcgaacct 600 

tggctctcaa ccttgaggca tatcgccagg gaggggcgtg catatgtcat tggcgtttgc 660 

atgccgatgc gcatgagcga catcccggac cgatacgagt tcaagcgcaa gtactatggc 720 

gggcgcgact ggatcaatac tggtgacagc gccatcgtgg gtccggacgg aaacttcatc 780 

gccggccccc tgagcgagcg cgaagagatc ctgtacgccg atatagacct gaatcggctt 840 

gcgaactcga agtggatgct ggacgtcgcc gggcactatg cacggccgga cgtcttccag 900 

ttgaccgtta accgcgagcc gaacccgatg atctctgagg atgggcacaa gacggttccc 960 

acgctaccga aacgtgcggg gaagagtagg acgagatga 999 

<210> 246 
<211> 332 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 246 






















Met Gly Glu 


Asn 


Ala 


Asn 


Phe 


Thr 


Val Ala Ala 


Val 


Gin 


Ala Thr 


Pro 


1 




5 








10 






15 




Val Phe Leu 


Asp 


Arg 


Asp 


Ala 


Thr 


Val Glu Lys 


Ala Cys 


Glu Leu 


lie 




20 










25 






30 




Ala Glu Ala 


Gly Arg 


Asn Gly Ala 


Arg Leu Ala 


Val 


Phe 


Pro Glu 


Ala 


35 










40 






45 






Phe Val Pro 


Ala 


Tyr 


Pro 


Asp 


Trp 


Val Trp Ala 


Val 


Pro 


Pro Gly Asp 


50 








55 






60 








Ser Arg Leu 


Leu 


His 


Glu 


Leu 


Tyr 


Gly Glu Leu 


He 


Gin 


Asn Ser 


Val 


65 






70 






75 








80 


Thr He Pro 


Ser 


Glu 


Ser 


Thr 


Glu 


Lys Leu Cys 


Arg 


Ala 


Ala Arg 


Gly 
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85 90 95 

Ala Lys Val Cys Val Ala lie Gly lie Asn Glu Arg Asn Ala Glu Ala 

100 105 110 

Ser Gly Gly Ser Leu Tyr Asn Ser Leu Leu Tyr He Ser Pro Asp Gly 

115 120 125 

Gin Val Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly Ala Glu Arg 

130 135 140 

Leu Val Trp Ala Gin Gly Asp Gly Ser Thr He Asp Val Phe Glu Leu 
145 150 155 160 

Pro Phe Cys Arg Leu Gly Gly Leu He Cys Trp Glu Asn Tyr Met Pro 

165 170 175 

Leu Ala Arg Tyr Ala Met Tyr Ala Trp Gly Thr Gin Val Tyr Val Ala 

180 185 190 

Ala- Thr Trp Asp His Gly Glu Pro Trp Leu Ser Thr Leu Arg His He 

195 200 205 

Ala Arg Glu Gly Arg Ala Tyr Val He Gly Val Cys Met Pro Met Arg 

210 215 220 

Met Ser Asp He Pro Asp Arg Tyr Glu Phe Lys Arg Lys Tyr Tyr Gly 
225 230 235 240 

Gly Arg Asp Trp He Asn Thr Gly Asp Ser Ala He Val Gly Pro Asp 

245 250 255 

Gly Asn Phe He Ala Gly Pro Leu Ser Glu Arg Glu Glu He Leu Tyr 

260 265 270 

Ala Asp He Asp Leu Asn Arg Leu Ala Asn Ser Lys Trp Met Leu Asp 

275 280 285 

Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Gin Leu Thr Val Asn 

290 295 300 

Arg Glu Pro Asn Pro Met He Ser Glu Asp Gly His Lys Thr Val Pro 
305 310 315 320 

Thr Leu Pro Lys Arg Ala Gly Lys Ser Arg Thr Arg 
325 330 

<210> 247 
<211> 990 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 247 

atgccgaccc 

gaccgggggg 

gcccggctga 

gttccgccgg 

gtggaagtcc 

tacctggtca 

accctcctct 

acgggagggg 

acccccctgg 

tacgccatgt 

ccctggcttt 

tgcatggcta 

cccgcagccc 

ttcatcgccg 

cagatggggg 

ttcgaactga 

ccgggagaga 



ccaaggaaaa 
agacggtaaa 
tcgtgtttcc 
ggagggagaa 
cggggccggt 
tgggggtcac 
atttcggtcc 
aacggaccgt 
gaaagatcgg 
atgcgtgggg 
caaccctgcg 
tgcaaaaagg 
gggagtggat 
ggccggcggg 
ggcccaagtg 
tcgttcaccg 
aagaaacggg 



gtttagaatc 
gaaggcctgc 
ggagtccttc 
gctcctgaat 
gacggaggaa 
cgagcgggac 
ccagggaagc 
ctgggcccgg 
cggcctgatc 
cactcagatt 
gcacatcgcc 
ggacatcccg 
caacacgggc 
aaagaaggaa 
gatgctggac 
ggaggcccgg 
tgaaggatag 



gccgccgttc 
cggctggcgg 
atccccgctt 
gaaatgtacg 
ttgggccggg 
accgaggcca 
ctgctgggaa 
ggggacggca 
tgctgggaga 
tacctggccc 
aaggaagggc 
gatcgcttcg 
gacagcgcca 
gagatcctgt 
gtggccggcc 
ccgatgatcc 



aggcttgccc 
ccgaggccgg 
acccggactg 
ccgaattcct 
cggcggaaag 
gcggggcaag 
aacaccgcaa 
gcacgctgca 
actacatgcc 
ccacctggga 
gggtttacgt 
aatacaagca 
tcctgaaccc 
acgctgaaat 
attacgcccg 
gggtgacgga 



cgtttttctg 
gggccagggc 
ggtgtgggcc 
ggccggcgcg 
ggccggcgct 
cctgtacaac 
actggtgccc 
ggtgtacgat 
cctggcccgc 
ccggggggaa 
ggtggggtgc 
aaaatactat 
ggagggggaa 
agacccccgg 
gccggatgtc 
agcgccgtct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
990 



<210> 248 
<211> 329 
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<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 248 

Met Pro Thr Pro Lys Glu Lys Phe Arg He Ala Ala Val Gin Ala Cys 

15 10 15 

Pro Val Phe Leu Asp Arg Gly Glu Thr Val Lys Lys Ala Cys Arg Leu 

20 25 30 

Ala Ala Glu Ala Gly Gly Gin Gly Ala Arg Leu He Val Phe Pro Glu 

35 40 45 

Ser Phe He Pro Ala Tyr Pro Asp Trp Val Trp Ala Val Pro Pro Gly 

50 55 60 

Arg Glu Lys Leu Leu Asn Glu Met Tyr Ala Glu Phe Leu Ala Gly Ala 
65 70 75 80 

Val Glu Val Pro Gly Pro Val Thr Glu Glu Leu Gly Arg Ala Ala Glu 

85 90 95 

Arg Ala Gly Ala Tyr Leu Val Met Gly Val Thr Glu Arg Asp Thr Glu 

100 105 110 

Ala Ser Gly Ala Ser Leu Tyr Asn Thr Leu Leu Tyr Phe Gly Pro Gin 

115 120 125 

Gly Ser Leu Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly Gly Glu 

130 135 140 

Arg Thr Val Trp Ala Arg Gly Asp Gly Ser Thr Leu Gin Val Tyr Asp 
145 150 155 160 

Thr Pro Leu Gly Lys He Gly Gly Leu He Cys Trp Glu Asn Tyr Met 

165 170 175 

Pro Leu Ala Arg Tyr Ala Met Tyr Ala Trp Gly Thr Gin He Tyr Leu 

180 185 190 

Ala Pro Thr Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu Arg His 

195 200 205 

He Ala Lys Glu Gly Arg Val Tyr Val Val Gly Cys Cys Met Ala Met 

210 215 220 

Gin Lys Gly Asp He Pro Asp Arg Phe Glu Tyr Lys Gin Lys Tyr Tyr 
225 230 . 235 240 

Pro Ala Ala Arg Glu Trp He Asn Thr Gly Asp Ser Ala He Leu Asn 

245 250 255 

Pro Glu Gly Glu Phe He Ala Gly Pro Ala Gly Lys Lys Glu Glu He 

260 265 270 

Leu Tyr Ala Glu He Asp Pro Arg Gin Met Gly Gly Pro Lys Trp Met 

275 280 285 

Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Glu Leu He 

290 295 300 

Val His Arg Glu Ala Arg Pro Met He Arg Val Thr Glu Ala Pro Ser 
305 310 315 320 

Pro Gly Glu Lys Glu Thr Gly Glu Gly 
325 

<210> 249 
<2H> 1017 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 249 

atgggcattc aacatccgaa atatcgcgtc gcggtggtgc aggcggcacc ggcctggctc 60 
gacctcgagg cgtcggtcag caagagcatc gcgctgatag aggaggccgc cgccaagggc 120 
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gccaagctga tcgcgttccc cgaggccttc atccccggct atccctggta catctggctg 180 

gactcgccgg cctgggcgat cggccgcggc ttcgtgcagc gctatttcga caattcgctc 240 

agctatgaca gcccgcaggc ggagcgcctg aggctcgcag tgaagaaggc cggcatgacc 300 

gcagtgctcg gcctgtccga gcgcgacggc ggcagcctct atctcgcgca atggttgatc 360 

ggacccgacg gcgagaccat cgcaaagcgg cgcaagctgc ggccgaccca tgccgagcgc 420 

accgtctacg gcgagggcga cggcagcgac cttgcggtgc atgaccgccc cggcatcggc 480 

cggctcggtg cgctgtgctg ctgggagcat ctgcagccgc tgtcgaaata cgcgatgtac 540 

gcccagaacg agcaggtgca tgtcgcggcc tggccgagct tctcgctgta cgatccgttc 600 

gcgccggcgc tcggctggga ggtcaacaat gcggcctcgc gcgtctatgc cgtcgagggc 660 

tcctgcttcg tgctggcgcc ctgcgccacc gtctcgcagg cgatgatcga cgagctctgc 720 

gaccgcgacg acaagcatgc gctgctgcat gttggcggcg gccatgccgc gatcttcggc 780 

cccgacggca gcgcgatcgc ggacaagctt ccgtccgacc aggagggcct cctgttcgcc 840 

gacatcgatc tcggcgcgat cgggatcgcg aagaatgccg ctgatccggc cgggcactat 900 

tcgcgcccgg acgtgacgcg gctgctgctc aacaagaagc cctcgaagcg cgtcgagcac 960 

ttcgcgctgc cgctcgacac gctcgcgggc gaggagatcg acgcggccgc aagctaa 1017 

<210> 250 
<211> 338 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained front an environmental sample 
<400> 250 

Met Gly He Gin His Pro Lys Tyr Arg Val Ala Val Val Gin Ala Ala 

15 10 15 

Pro Ala Trp Leu Asp Leu Glu Ala Ser Val Ser Lys Ser He Ala Leu 

20 25 30 

He Glu Glu Ala Ala Ala Lys Gly Ala Lys Leu He Ala Phe Pro Glu 

35 40 45 

Ala Phe He Pro Gly Tyr Pro Trp Tyr He Trp Leu Asp Ser Pro Ala 

50 55 60 

Trp Ala He Gly Arg Gly Phe Val Gin Arg Tyr Phe Asp Asn Ser Leu 
65 70 75 80 

Ser Tyr Asp Ser Pro Gin Ala Glu Arg Leu Arg Leu Ala Val Lys Lys 

85 90 95 

Ala Gly Met Thr Ala Val Leu Gly Leu Ser Glu Arg Asp Gly Gly Ser 

100 105 HO 

Leu Tyr Leu Ala Gin Trp Leu He Gly Pro Asp Gly Glu Thr He Ala 

115 120 125 

Lys Arg Arg Lys Leu Arg Pro Thr His Ala Glu Arg Thr Val Tyr Gly 

130 135 140 

Glu Gly Asp Gly Ser Asp Leu Ala Val His Asp Arg Pro Gly He Gly 
145 150 155 160 

Arg Leu Gly Ala Leu Cys Cys Trp Glu His Leu Gin Pro Leu Ser Lys 

165 170 175 

Tyr Ala Met Tyr Ala Gin Asn Glu Gin Val His Val Ala Ala Trp Pro 

180 185 190 

Ser Phe Ser Leu Tyr Asp Pro Phe Ala Pro Ala Leu Gly Trp Glu Val 

195 200 205 

Asn Asn Ala Ala Ser Arg Val Tyr Ala Val Glu Gly Ser Cys Phe Val 

210 215 220 

Leu Ala Pro Cys Ala Thr Val Ser Gin Ala Met He Asp Glu Leu Cys 
225 230 235 240 

Asp Arg Asp Asp Lys His Ala Leu Leu His Val Gly Gly Gly His Ala 

245 250 255 

Ala He Phe Gly Pro Asp Gly Ser Ala He Ala Asp Lys Leu Pro Ser 

260 265 270 

Asp Gin Glu Gly Leu Leu Phe Ala Asp He Asp Leu Gly Ala He Gly 
275 280 285 
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lie Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp 

290 295 300 

Val Thr Arg Leu Leu Leu Asn Lys Lys Pro Ser Lys Arg Val Glu His 
305 310 315 320 

Phe Ala Leu Pro Leu Asp Thr Leu Ala Gly Glu Glu He Asp Ala Ala 
325 330 335 

Ala Ser 



<210> 251 
<211> 978 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 251 

gtgaccatcg tgagagctgc cgccgtgcag atcagtcccg tgctctacag ccgggaagcc 60 

accgtagaaa aagtcgttcg caagatccgc gaactgggaa gaaacggcgt gcagttcgcc 120 

accttcccgg aaaccctggt gccctactac ccgtacttcg cggccgtgca gacgggcatc 180 

gaactgctgt ccggcaagga gcacctgcga ctgctggaac aatccgtaac ggttccctcg 240 

cccgccaccg atgccattgc ccaggcggca cgcgaagccg gcatggtggt gtccatcggt 300 

gtcaacgagc gcgacggagg caccatctac aacacgcagc tgctgtttga cgccgacggc 360 

acgctggtac agcgccgccg caagatcacg ccgacgcatt tcgagcgcat ggtctggggc 420 

cagggcgacg gctcgggcct gcgagccgtg gacaccaagg ccggccgcat cggtcagctc 480 

gcctgcttcg agcacaacaa cccgctggcg cgctacgcca tgatcgccga cggtgagcag 540 

atccattcgg ccatgtaccc gggctctgcc ttcggcgagg gcttcgcgca gcgcatggaa 600 

atcaacatac gccagcacgc cctggagtct ggctgcttcg tggtgaatgc gaccgcgtgg 660 

ctggatgccg accagcaggc gcagatcatg aaggatacgg gctgcggcat cggcccgatc 720 

tccggcggct gcttcacgac catcgtcacg ccggacggca tgctgatcgg tgaacccctc 780 

cgcgaaggcg aaggcgaagt catcgccgac ctcgatttca ccctgatcga ccggcgcaag 840 

ctgctggtgg actcggtggg ccactacaac cgtccggagc tgctgagcct gctgatcgat 900 

cgcacccctg cggcgaactt ccatgagcgc aatgcgcttc cgtccgtcaa caccgccagc 960 

agcctcgaaa tcgtctga 9*78 

<210> 252 
<211> 325 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 252 



Val Thr 


He 


Val 


Arg 


Ala 


Ala 


Ala 


Val 


Gin 


He Ser Pro Val Leu Tyr 


1 






5 










10 


15 


Ser Arg 


Glu 


Ala 


Thr 


Val 


Glu 


Lys 


Val 


Val 


Arg Lys He Arg Glu Leu 






20 










25 




30 


Gly Arg 


Asn 


Gly 


Val 


Gin 


Phe 


Ala 


Thr 


Phe 


Pro Glu Thr Leu Val Pro 


35 








40 






45 


Tyr Tyr 


Pro 


Tyr 


Phe 


Ala 


Ala 


Val 


Gin 


Thr 


Gly He Glu Leu Leu Ser 


50 








55 








60 


Gly Lys 


Glu 


His 


Leu 


Arg 


Leu 


Leu 


Glu 


Gin 


Ser Val Thr Val Pro Ser 


65 








70 










75 80 


Pro Ala 


Thr 


Asp 


Ala 


He 


Ala 


Gin 


Ala 


Ala 


Arg Glu Ala Gly Met Val 






85 










90 


95 


Val Ser 


He 


Gly 


Val 


Asn 


Glu 


Arg 


Asp 


Gly 


Gly Thr He Tyr Asn Thr 






100 










105 




110 


Gin 'Leu 


Leu 


Phe 


Asp 


Ala 


Asp 


Gly 


Thr 


Leu 


Val Gin Arg Arg Arg Lys 




115 






120 






125 
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He Thr Pro Thr His Phe Glu 
130 135 
Ser Gly Leu Arg Ala Val Asp 
145 150 
Ala Cys Phe Glu His Asn Asn 
165 

Asp Gly Glu Gin He His Ser 
180 

Glu Gly Phe Ala Gin Arg Met 
195 

Glu Ser Gly Cys Phe Val Val 
210 215 
Gin Gin Ala Gin He Met Lys 
225 230 
Ser Gly Gly Cys Phe Thr Thr 
245 

Gly Glu Pro Leu Arg Glu Gly 
260 

Phe Thr Leu He Asp Arg Arg 
275 

Tyr Asn Arg Pro Glu Leu Leu 
290 295 
Ala Asn Phe His Glu Arg Asn 
305 310 
Ser Leu Glu He Val 
325 



Arg Met Val Trp Gly Gin Gly Asp Gly 
140 

Thr Lys Ala Gly Arg He Gly Gin Leu 
155 160 
Pro Leu Ala Arg Tyr Ala Met He Ala 

170 175 
Ala Met Tyr Pro Gly Ser Ala Phe Gly 

185 190 
Glu He Asn He Arg Gin His Ala Leu 
200 205 
Asn Ala Thr Ala Trp Leu Asp Ala Asp 
220 

Asp Thr Gly Cys Gly He Gly Pro He 
235 240 
He Val Thr Pro Asp Gly Met Leu He 

250 255 
Glu Gly Glu Val He Ala Asp Leu Asp 

265 270 
Lys Leu Leu Val Asp Ser Val Gly His 
280 285 
Ser Leu Leu I.le Asp Arg Thr Pro Ala 
300 

Ala Leu Pro Ser Val Asn Thr Ala Ser 
315 320 



<210> 253 
<211> 924 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 253 

atgtcaaacg 

ttcctcgatc 

gaaggggcga 

tgggcaatcc 

aattcggtca 

aatgcctacg 

tataacacgc 

gtgccaactg 

tacgatactc 

gcccgctaca 

gggcaaccct 

ggctgttgta 

ttttacgcgg 

gggcagttta 

ccgcgcatgg 

gacgtattcc 



agaaccacaa 
gtgaagcgac 
ggctggttgt 
caccgggcga 
cgattcccag 
tagtgatggg 
tattgtacat 
gcggcgagcg 
cactgggtaa 
ccatgtacgc 
ggctctccac 
tcgcgatgcg 
aggccgatga 
tcgcggggcc 
tgcaaggccc 
agttgacagt 



ccaaacattc 
catcgacaaa 
cttcccggag 
agaaggcgtg 
tgacgtgacg 
gatgagcgaa 
cgatgcgaag 
actggtgtgg 
actcggcggt 
ctggggcaca 
cttgcggcat 
caaagatgat 
gtggatcaat 
ggtacgcaaa 
gaagtggatg 
gtaa 



aaagttgccg 
gcttgcgagt 
gcattcatcc 
ctcaatgagt 
gatagactgt 
tgcaatgccg 
ggtgaaatcc 
gcacagggcg 
ttaatttgct 
caaatctata 
atcgccaaag 
atcccagagc 
ataggcgaca 
caggaagaga 
ctcgacgtgg 



cggtgcaggc 
tgattgctgc 
cgtcctatcc 
tgtacgcgga 
gccgggccgc 
aggccagtgg 
tgggtaaaca 
atggcagcac 
gggagaatta 
tcgcagcgac 
aaggcagggt 
gttacccaat 
gcgcgatcgt 
ttctctacgc 
cagggcacta 



cacacctgta 
agccggcaat 
agattgggta 
actgctctcc 
gaggcttgcc 
cgcaagcctg 
tcgaaagttg 
gctgcaggtc 
tatgccgctg 
atgggatcgc 
gtacgtgatc 
gaagcagaag 
caatcctgaa 
ggagattaat 
cgccaggccg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
924 



<210> 254 
<211> 307 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
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<<100> 254 

Met Ser Asn Glu Asn His Asn Gin Thr Phe Lys Val Ala Ala Val Gin 

15 10 15 

Ala Thr Pro Val Phe Leu Asp Arg Glu Ala Thr He Asp Lys Ala Cys 

20 25 30 

Glu Leu He Ala Ala Ala Gly Asn Glu Gly Ala Arg Leu Val Val Phe 

35 40 45 

Pro Glu Ala Phe He Pro Ser Tyr Pro Asp Trp Val Trp Ala He Pro 

50 55 €0 

Pro Gly Glu Glu Gly Val Leu Asn Glu Leu Tyr Ala Glu Leu Leu Ser 
65 70 75 80 

Asn Ser Val Thr He Pro Ser Asp Val Thr Asp Arg Leu Cys Arg Ala 

85 90 95 

Ala Arg Leu Ala Asn Ala Tyr Val Val Met Gly Met Ser Glu Cys Asn 

100 105 HO 

Ala Glu Ala Ser Gly Ala Ser Leu Tyr Asn Thr Leu Leu Tyr He Asp 

115 120 125 

Ala Lys Gly Glu He Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly 

130 135 140 

Gly Glu Arg Leu Val Trp Ala Gin Gly Asp Gly Ser Thr Leu Gin Val 
145 150 155 160 

Tyr Asp Thr Pro Leu Gly Lys Leu Gly Gly Leu He Cys Trp Glu Asn 

165 170 175 

Tyr Met Pro Leu Ala Arg Tyr Thr Met Tyr Ala Trp Gly Thr Gin He 

180 185 190 

Tyr He Ala Ala Thr Trp Asp Arg Gly Gin Pro Trp Leu Ser Thr Leu 

195 200 205 

Arg His He Ala Lys Glu Gly Arg Val Tyr Val He Gly Cys Cys He 

210 215 220 

Ala Met Arg Lys Asp Asp He Pro Glu Arg Tyr Pro Met Lys Gin Lys 
225 230 235 240 

Phe Tyr Ala Glu Ala Asp Glu Trp He Asn He Gly Asp Ser Ala He 

245 250 255 

Val Asn Pro Glu Gly Gin Phe He Ala Gly Pro Val Arg Lys Gin Glu 

260 265 270 

Glu He Leu Tyr Ala Glu He Asn Pro Arg Met Val Gin Gly Pro Lys 

. 275 280 285 

Trp Met Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Gin 

290 295 300 

Leu Thr Val 
305 

<210> 255 
<211> 1005 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 255 

atgacaatgt ctaagaccaa atttagggtt gcagctgtgc aggcggcgcc ggttttcctt 60 

gatcgggaag cgacgttgga taaagcttgt ggattgattg aggaggcggg ccgcaacggc 120 

gccagcctcg tcgtcttccc tgagtcattc attccggcct accccgattg ggtttgggct 180 

gtgccggcgg gcgaagaagc tttactcaat gaactgtacg cacaactgtt ggccaacgcc 240 

gttgaaattc ccggcccggc cactcaacgt ttgagccagg cggctaaaaa ggctaaggtt 300 

cacctggcta tgggcctgac cgaacgcaac agcgaggcca gcggcggcag cctttacaac 360 

accttgctct atcttgaccc gcagggccac attctgggca agcatcgtaa gctggtgccc 420 

accggcggtg agcggctggt ttgggcccag ggcgacggca gcactttgca agtttacgat 480 

acgcctctgg gtaaactcag cggcctgatt tgctgggaaa attatatgcc gctggcgcgc 54 0 
tacgcgctgt atgcctgggg tacgcaaatt tatattgcgg ccacctggga tcggggtgag 



600 
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ccgtggcttt cgacgttgcg gcatattgcc aaagagggcc gggtgttggt catcggttgc 660 

ggtatggcct tgcgcaaggc tgatattcct gatcattttg aattcaagca gcgcttttat 720 

caaaacgccg ccgagtggat caacgggggc gacagcgcca ttgtcaaccc tgatggtgaa 780 

tttattgctg gccccttaag cgagcaggaa ggcattttgt acgccgagat tgatccggcc 840 

cagatgggcg ggccaaagtg gatgctcgac gtggccgggc attacgctcg cccggatgtg 900 

tttgaactga cggtccatac cgccgcccga cccatgatca cctcgaaaaa ggatggccta 960 

acacccgccg aggccgttac gcaagtaacg aaagcattat tgtaa 1005 

<210> 256 
<211> 334 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 256 

Met Thr Met Ser Lys Thr Lys Phe Arg Val Ala Ala Val Gin Ala Ala 

15 10 15 

Pro Val Phe Leu Asp Arg Glu Ala Thr Leu Asp Lys Ala Cys Gly Leu 

20 25 30 

lie Glu Glu Ala Gly Arg Asn Gly Ala Ser Leu Val Val Phe Pro Glu 

35 40 45 

Ser Phe lie Pro Ala Tyr Pro Asp Trp Val Trp Ala Val Pro Ala Gly 

50 55 60 

Glu Glu Ala Leu Leu Asn Glu Leu Tyr Ala Gin Leu Leu Ala Asn Ala 
65 70 75 80 

Val Glu lie Pro Gly Pro Ala Thr Gin Arg Leu Ser Gin Ala Ala Lys 

85 90 95 

Lys Ala Lys Val His Leu Ala Met Gly Leu Thr Glu Arg Asn Ser Glu 

100 105 110 

Ala Ser Gly Gly Ser Leu Tyr Asn Thr Leu Leu Tyr Leu Asp Pro Gin 

115 120 125 

Gly His He Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly Gly Glu 

130 135 140 

Arg Leu Val Trp Ala Gin Gly Asp Gly Ser Thr Leu Gin Val Tyr Asp 
145 150 . 155 160 

Thr Pro Leu Gly Lys Leu Ser Gly Leu lie Cys Trp Glu Asn Tyr Met 

165 170 175 

Pro Leu Ala Arg Tyr Ala Leu Tyr Ala Trp Gly Thr Gin He Tyr He 

180 185 190 

Ala Ala Thr Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu Arg His 

195 200 205 

He Ala Lys Glu Gly Arg Val Leu Val He Gly Cys Gly Met Ala Leu 

210 215 220 

Arg Lys Ala Asp He Pro Asp His Phe Glu Phe Lys Gin Arg Phe Tyr 
225 230 235 240 

Gin Asn Ala Ala Glu Trp He Asn Gly Gly Asp Ser Ala He Val Asn 

„ 245 250 255 

Pro Asp Gly Glu Phe He Ala Gly Pro Leu Ser Glu Gin Glu Gly He 

260 265 270 

Leu Tyr Ala Glu He Asp Pro Ala Gin Met Gly Gly Pro Lys Trp Met 

275 280 285 

Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Glu Leu Thr 

290 295 300 

Val His Thr Ala Ala Arg Pro Met He Thr Ser Lys Lys Asp Gly Leu 
305 310 315 320 

Thr Pro Ala Glu Ala Val Thr Gin Val Thr Lys Ala Leu Leu 
325 330 

<210> 257 
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<211> 942 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 257 

atgaacaaga tcgcgatcat tcagcgccca cccgtgctac tcgaccgcat cgccacgctg 60 

gcggttgcgg tggagtcgat cgacgaagct gccgcagccg gtgcctcact gatcgttctt 120 

ccagaaacct tcatccccgg ctacccgtcc tggatctggc gtctcgcgcc ggggagggat 180 

ggtgcgctca ttgcccagtt gcatgcccga ctgctcgcca acgcggtcga tcttgcggct 240 

ggagatctgg atgccctgtg tgaagttgcg cacggccacc gggtgaccgt ggtgtgcggc 300 

ctcaacgaat gcgagcgcag tcgcggcggg ggcactctct acaacacggt cgtcgtgatc 360 

gaccccgacg gcaagctgtg caatcgccac cgcaagctga tgccgaccaa cccggaacgc 420 

atggtgcacg gtctgggtga tgcatcgggc ctgcgcgccg tcgacacccc ggtgggtcga 480 

gtgggcgcac tcatctgctg ggaaaactat atgccgctgg cacgctacgc actttacgcc 540 

gagggggtgg aagtctacgt ggcgcccacc tatgacagcg gcgatggctg gatcagtacg 600 

atgcgtcata ttgcgcttga gggacgctgc tgggtgctgg gtagcggaac cgtactgcgt 660 

ggcagcgacg tcccagaaga ctttccgtca cacctggacc tgtttcccga cgcggaggaa 720 

tggatcaatc cgggcgactc ggtggtcgtc gatcctcagg gcaagatcgt cgcaggcccg 780 

atgcgacgtg agacaggcat tctctacgca gaaatcgacg ccgaacgggt cgcgccttcg 840 

cgccgcacgc tcgatgtcgc cggacactac gcccgcccgg atattttcga gctccatgtc 900 

cgacgtacgc cggcgatgcc ggtccacgcc gttgatgcat ga 942 

<210> 258 

<211> 313 

<212> PRT 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 258 

Met Asn Lys lie Ala lie lie Gin Arg Pro Pro Val Leu Leu Asp Arg 

1.5 10 15 

lie Ala Thr Leu Ala Val Ala Val Glu Ser lie Asp Glu Ala Ala Ala 

20 25 30 

Ala Gly Ala Ser Leu lie Val Leu Pro Glu Thr Phe He Pro Gly Tyr 

35 40 45 

Pro Ser Trp He Trp Arg Leu Ala Pro Gly Arg Asp Gly Ala Leu He 

50 55 60 

Ala Gin Leu His Ala Arg Leu Leu Ala Asn Ala Val Asp Leu Ala Ala 
65 70 75 80 

Glv Asp Leu Asp Ala Leu Cys Glu Val Ala His Gly His Arg Val Thr 

y 85 90 * 95 

Val Val Cys Gly Leu Asn Glu Cys Glu Arg Ser Arg Gly Gly Gly Thr 

100 105 HO 

Leu Tyr Asn Thr Val Val Val He Asp Pro Asp Gly Lys Leu Cys Asn 

115 120 125 

Arg His Arg Lys Leu Met Pro Thr Asn Pro Glu Arg Met Val His Gly 

130 135 140 

Leu Gly Asp Ala Ser Gly Leu Arg Ala Val Asp Thr Pro Val Gly Arg 
145 150 155 160 

Val Gly Ala Leu He Cys Trp Glu Asn Tyr Met Pro Leu Ala Arg Tyr 

165 170 175 

Ala Leu Tyr Ala Glu Gly Val Glu Val Tyr Val Ala Pro Thr Tyr Asp 

180 185 190 

Ser Gly Asp Gly Trp He Ser Thr Met Arg His He Ala Leu Glu Gly 

195 200 205 

Arg Cys Trp Val Leu Gly Ser Gly Thr Val Leu Arg Gly Ser Asp Val 
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210 






215 


220 






Pro 


Glu 


Asp 


Phe Pro Ser 


His 


Leu Asp Leu Phe Pro 


Asp Ala Glu 


Glu 


225 




230 




235 




240 


Trp 


lie 


Asn 


Pro Gly Asp 


Ser 


Val Val Val Asp Pro 


Gin Gly Lys 


He 






245 




250 


255 




Val 


Ala 


Gly 


Pro Met Arg 


Arg 


Glu Thr Gly He Leu 


Tyr Ala Glu 


He 








260 




265 


270 




Asp Ala 


Glu 


Arg Val Ala 


Pro 


Ser Arg Arg Thr Leu 


Asp Val Ala 


Gly 






275 






280 


285 




His 


Tyr 


Ala 


Arg Pro Asp 


He 


Phe Glu Leu His Val 


Arg Arg Thr 


Pro 




290 






295 


300 






Ala 


Met 


Pro 


Val His Ala 


Val 


Asp Ala 







305 310 

<210> 259 
<211> 981 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 259 

atggccatca tcaaagccgc cgccgtccag atcagtccgg tgctgtatag ccgcgaggga 60 

accgtggaca aggtctgcca gcagatcatc gccctcggtc agcaaggcgt gcaatttgcg 120 

gtctttccgg aaacggtggt gccgtactac ccctatttct ccttcgtcca gccgccgttc 180 

gccatgggca aggaacacct gaaactgttg gaacaatcgg tgattgtgcc gtcggctgcc 240 

accttggcga tcggcgaagc gtgcaaacaa gcgggaatgg tggtgtctat cggcgtcaat 300 

gagcgcgatg gcggcacgat ctacaacgcc cagttgctgt ttgacgctga tggcagcttg 360 

attcagcacc gtcgcaaaat aaccccgacg taccacgagc ggatgatctg gggtcaaggc 420 

gacggctccg ggttgcgcgc catcgacagc gcggtcgggc gtattggctc gctggcctgc 480 

tgggaacatt acaacccctt ggcccgctac gccctgatgg ccgacggcga gcagattcac 540 

gcggctatgt ttcccggctc tctggtgggt gacatttttg ccgatcagat agaggtcact 600 

attcgtcatc acgccttgga gtccggctgc ttcgtggtca actccaccgc gtggcttgat 660 

gctgatcagc aaggccaaat catgcaggac accggttgca gcattggccc aatctcgggt 720 

ggctgcttca cggccatcgt ttccccggaa ggcaaattac tcggcgaacc gctgcgttca 780 

ggtgagggcg cagtcatcgc cgacctggac atggcattga tcgacaagcg caaacggatg 840 

atggattccg tcggccatta cagccgccca gaactgctca gtttattgat cgaccgcacg 900 

cccaccgctc atgtgcatga gcgcggcgcc catcaccttg ccgtagcctc tatcggggag 960 

cttgaccatg caaaccaatg a 981 

<210> 260 
<211> 326 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 260 



Met Ala 


He 


He 


Lys 


Ala 


Ala 


Ala 


Val Gin He Ser Pro Val Leu 


Tyr 


1 






5 








10 15 




Ser Arg 


Glu 


Gly 


Thr 


Val 


Asp 


Lys 


Val Cys Gin Gin He He Ala 


Leu 




20 










25 30 




Gly Gin 


Gin 


Gly 


Val 


Gin 


Phe 


Ala 


Val Phe Pro Glu Thr Val Val 


Pro 


35 










40 


45 




Tyr Tyr 


Pro 


Tyr 


Phe 


Ser 


Phe 


Val 


Gin Pro Pro Phe Ala Met Gly Lys 


50 










55 




60 




Glu His 


Leu 


Lys 


Leu 


Leu 


Glu 


Gin 


Ser Val He Val Pro Ser Ala 


Ala 


65 ' 






70 






75 


80 


Thr Leu 


Ala 


lie 


Gly 


Glu 


Ala 


Cys 


Lys Gin Ala Gly Met Val Val 


Ser 
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85 90 95 

lie Gly Val Asn Glu Arg Asp Gly Gly Thr He Tyr Asn Ala Gin Leu 

100 105 HO 

Leu Phe Asp Ala Asp Gly Ser Leu He Gin His Arg Arg Lys He Thr 

115 120 125 

Pro Thr Tyr His Glu Arg Met He Trp Gly Gin Gly Asp Gly Ser Gly 

130 135 140 

Leu Arg Ala He Asp Ser Ala Val Gly Arg He Gly Ser Leu Ala Cys 
145 150 155 160 

Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met Ala Asp Gly 

165 170 175 

Glu Gin He His Ala Ala Met Phe Pro Gly Ser Leu Val Gly Asp He 

180 185 190 

Phe Ala Asp Gin He Glu Val Thr He Arg His His Ala Leu Glu Ser 

195 200 205 

Gly Cys Phe Val Val Asn Ser Thr Ala Trp Leu Asp Ala Asp Gin Gin 

210 215 220 

Gly Gin He Met Gin Asp Thr Gly Cys Ser He Gly Pro He Ser Gly 
225 230 235 240 

Gly Cys Phe Thr Ala He Val Ser Pro Glu Gly Lys Leu Leu Gly Glu 

245 250 • '255 

Pro Leu Arg Ser Gly Glu Gly Ala Val He Ala Asp Leu Asp Met Ala 

260 265 270 

Leu He Asp Lys Arg Lys Arg Met Met Asp Ser Val Gly His Tyr Ser 

275 280 285 

Arg Pro Glu Leu Leu Ser Leu Leu He Asp Arg Thr Pro Thr Ala His 

290 295 300 

Val His Glu Arg Gly Ala His His Leu Ala Val Ala Ser He Gly Glu 
305 310 315 320 

Leu Asp His Ala Asn Gin 
325 

<210> 261 
<211> 1014 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 261 

atgggtctgg ttcatcagaa atacaaggta gcggtggttc aggcggcacc ggtttttctc 60 

gatctcgatg caaccgtgga caagaccatt gcgctgatcg aggaagcttc cgcgcaaggc 120 

gcaaaactgg tcgcgtttcc cgagaccttc attcccggat atccgtggca gatctggctc 180 

ggcgcgccgg cctgggcgat cggccgcggc tttgtgcagc gctacttcga caattcgctg 240 

ggcttcgaca gcccgcaggc ggaaaaaatc cgccaggccg tgaagcgcgc caagctgacc 300 

gcggtgcttg ggctatccga acgcgacggc ggcagcctct atatcgcgca gtggctgatc 360 

ggccctgacg gcgagaccat cgccaagcgc cggaaactgc gtccgaccca tgccgaacgc 420 

accgtgttcg gcgaaggcga cggcagcgat cttgcggtcc acgatcgcgc cgacgtcggg 480 

cgcttgggcg cactgtgctg ctgggagcac ctgcagccgc tatcgaaata cgcgatgtac 540 

gcccagaacg aacaggtgca tgtcggcgcc tggccgagct tttcgctgta cgatccgttc 600 

gcacatgcgc tcggccatga ggtcaacaac gccgccagca aggtctatgc ggtcgaaggc 660 

tcctgcttct tcctgggtcc ctgcgctgtc gtctcgcagg cgatgatcga cgagctctgc 720 

gactctcccg agaaacatgc cttcctgcat gtcggcggcg gccacgcggt gatctacggc 78.0 

ccggacggca gttcgctggc cgagaaactc ccgcccgacc aggaaggcat tctgtacgcc 840 

gatatcgatc tcggcatgat cggggtcgcg aagaacgccg ccgatccggc cgggcattat 900 

tcgcggcccg acgtcacgcg gctgttgctc aacaccacgc gcgccaaccg cgtcgagcat 960 

ttctcgcttc ccgtcgatgc cgaggtcatg agcgaaatca ggctgcaggc ctga 1014 



<210> 262 
<211> 337 
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<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 262 

Met Gly Leu Val His Gin Lys Tyr Lys Val Ala Val Val Gin Ala Ala 

15 10 15 

Pro Val Phe Leu Asp Leu Asp Ala Thr Val Asp Lys Thr He Ala Leu 

20 25 30 

He Glu Glu Ala Ser Ala Gin Gly Ala Lys Leu Val Ala Phe Pro Glu 

35 40 45 

Thr Phe He Pro Gly Tyr Pro Trp Gin He Trp Leu Gly Ala Pro Ala 

50 55 60 

Trp Ala He Gly Arg Gly Phe Val Gin Arg Tyr Phe Asp Asn Ser Leu 
65 70 75 80 

Gly Phe Asp Ser Pro Gin Ala Glu Lys He Arg Gin Ala Val Lys Arg 

85 90 95 

Ala Lys Leu Thr Ala Val Leu Gly Leu Ser Glu Arg Asp Gly Gly Ser 

100 105 HO 

Leu Tyr He Ala Gin Trp Leu He Gly Pro Asp Gly Glu Thr He Ala 

115 120 125 

Lys Arg Arg Lys Leu Arg Pro Thr His Ala Glu Arg Thr Val Phe Gly 

130 135 140 

Glu Gly Asp Gly Ser Asp Leu Ala Val His Asp Arg Ala Asp Val Gly 
145 150 155 160 

Arg Leu Gly Ala Leu Cys Cys Trp Glu His Leu Gin Pro Leu Ser Lys 

165 170 175 

Tyr Ala Met Tyr Ala Gin Asn Glu Gin Val His Val Gly Ala Trp Pro 

180 185 190 

Ser Phe Ser Leu Tyr Asp Pro Phe Ala His Ala Leu Gly His Glu Val 

195 200 205 

Asn Asn Ala Ala Ser Lys Val Tyr Ala Val Glu Gly Ser Cys Phe Phe 

210 215 220 

Leu Gly Pro Cys Ala Val Val Ser Gin Ala Met He Asp Glu Leu Cys 
225 230 235 240 

Asp Ser Pro Glu Lys His Ala Phe Leu His Val Gly Gly Gly His Ala 

245 250 255 

Val He Tyr Gly Pro Asp Gly Ser Ser Leu Ala Glu Lys Leu Pro Pro 

260 265 270 

Asp Gin Glu Gly He Leu Tyr Ala Asp He Asp Leu Gly Met He Gly 

275 280 285 

Val Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp 

290 295 300 

Val Thr Arg Leu Leu Leu Asn Thr Thr Arg Ala Asn Arg Val Glu His 
305 310 315 320 

Phe Ser Leu Pro Val Asp Ala Glu Val Met Ser Glu He Arg Leu Gin 
325 330 335 

Ala 



<210> 263 
<211> 1014 
<212> DNA 
<213> Unknown 



<220> 
<223> 



Obtained from an environmental sample 



<400> 



263 
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atgaaagtcg 
accgtcgcga 
accttcccgg 
cagcttctgg 
cccgcgaccg 
gtcaacgagc 
accctgatcc 
cagggtgacg 
gcatgcttcg 
atccattcgg 
atcaatatcc 
ctggatgccg 
tcgggcggtt 
cgctcgggcg 
atgctgatgg 
cgcacgccga 
ccgcgctccg 



tcaaagcggc 
aggtcgtgcg 
aaaccgttgt 
ctggaaccga 
acgcgatcgg 
gtgatggtgg 
agcgccgccg 
ggtcgggcct 
agcacaacaa 
ccatgtatcc 
gccagcacgc 
accagcaggc 
gcttcaccac 
agggcgaggt 
actcggcagg 
ccgcaaacgt 
gcgatctgcc 



agcggttcag 
gaagatccac 
gccgtactac 
gcatctgaaa 
cgaggcagcc 
aaccctgtac 
caagatcacg 
gcgtgccgtc 
tcctctggcg 
gggttctgcc 
actggagtcc 
gcaaatcatg 
gatcgtggca 
catcgccgat 
ccactacaac 
gcacgaacgg 
cgctgcgctg 



ttgagccctg 
gagcttgggc 
ccgtatttct 
ttgctcgacc 
cggaaggcgg 
aacacccaat 
cccacccatt 
gacagcaagg 
cgctacgcga 
ttcggcgagg 
gggtgcttcg 
aaagacacgg 
cccgacggca 
ctcgatttca 
cgtccggaac 
atggcgcatc 
gctggcgcgc 



tcctctacag 
agcagggcgt 
ccgcggtcca 
aggccgtgac 
gcatggtggt 
tgctcttcga 
tcgagcgcat 
tcggccgcat 
tgatggccga 
gctttgccca 
tcgtgaacgc 
gctgcgggat 
cgctgctggg 
cggagatcga 
tgctcagtct 
cccaagcgag 
aggagatcct 



ccgcgaggca 
gcagttcgcc 
gacgccgatg 
ggtgccgtct 
gtccatcggc 
cgccgatggc 
gatctggggc 
tggccagctg 
cggcgagcag 
gaggatggaa 
gacggcctgg 
cggtccgatc 
ggaacctctg 
ccggcgcaag 
gctgatcgac 
cacgaagcag 
gtga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1014 



<210> 264 
<211> 337 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
<400> 264 

Met Lys Val Val Lys Ala Ala Ala Val Gin Leu Ser Pro Val Leu Tyr 

15 10 15 

Ser Arg Glu Ala Thr Val Ala Lys Val Val Arg Lys lie His Glu Leu 

20 25 30 

Gly Gin Gin Gly Val Gin Phe Ala Thr Phe Pro Glu Thr Val Val Pro 

35 40 45 

Tyr Tyr Pro Tyr Phe Ser Ala Val Gin Thr Pro Met Gin Leu Leu Ala 

50 55 60 

Gly Thr Glu His Leu Lys Leu Leu Asp Gin Ala Val Thr Val Pro Ser 
65 *70 . 75 80 

Pro Ala Thr Asp Ala He Gly Glu Ala Ala Arg Lys Ala Gly Met Val 

85 90 95 

Val Ser He Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn Thr 

100 105 110 

Gin Leu Leu Phe Asp Ala Asp Gly Thr Leu He Gin Arg Arg Arg Lys 

115 120 125 

He Thr Pro Thr His Phe Glu Arg Met He Trp Gly Gin Gly Asp Gly 

130 135 140 

Ser Gly Leu Arg Ala Val Asp Ser Lys Val Gly Arg He Gly Gin Leu 
145 150 155 160 

Ala Cys Phe Glu His Asn Asn Pro Leu Ala Arg Tyr Ala Met Met Ala 

165 170 175 

Asp Gly Glu Gin He His Ser Ala Met Tyr Pro Gly Ser Ala Phe Gly 

180 185 190 

Glu Gly Phe Ala Gin Arg Met Glu He Asn He Arg Gin His Ala Leu 

195 200 205 

Glu Ser Gly Cys Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp 

210 215 220 

Gin Gin Ala Gin He Met Lys Asp Thr Gly Cys Gly He Gly Pro He 
225 230 235 240 

Ser Gly Gly Cys Phe Thr Thr He Val Ala Pro Asp Gly Thr Leu Leu 

245 250 255 

Gly Glu Pro Leu Arg Ser Gly Glu Gly Glu Val He Ala Asp Leu Asp 
260 265 270 
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Phe Thr Glu He Asp Arg Arg Lys 
275 280 
Tyr Asn Arg Pro Glu Leu Leu Ser 

290 295 
Ala Asn Val His Glu Arg Met Ala 
305 310 
Pro Arg Ser Gly Asp Leu Pro Ala 
325 

Leu 



Met Leu Met Asp Ser Ala Gly His 
285 

Leu Leu He Asp Arg Thr Pro Thr 
300 

His Pro Gin Ala Ser Thr Lys Gin 
315 320 
Ala Leu Ala Gly Ala Gin Glu He 
330 335 



35 



50 

Thr Val 
65 



Leu 


Gly 
5 


He 


Val 


Gin 


Met 


Asn 
10 


Ala 


Glu 


Pro Leu 


Asn 
15 


Val 


Leu 


Leu 


Lys 


Ala 


Glu 


Arg 


Tyr 


Val 


Ala 


Lys Cys 


Ala 


Ala 


20 








25 








30 




Phe 


Gin 


Leu 


Val 


Val 


Leu 
40 


Pro 


Glu 


Met 


Phe 


Asn Val 
45 


Gly 


Glu 


Ser 


Leu 


Met 


Met 


Val 


Ala 


Glu Pro Leu Asp Gly Lys 








55 










60 






He 
80 


Trp 


Leu 


Gin 
70 


Arg 


Gin 


Ala 


Ser 


Thr 
75 


His 


Asn He 


Tyr 


Leu 


Tyr 


Glu Arg Tyr Asp 


Glu 


His 


Phe 


Tyr Asn 


Thr 


Met 




85 










90 








95 


Asn 


Gly 


Tyr 


Asp 


Gly 


Ser 


Val 


Gin 


Tyr 


Tyr 


Arg Lys 


Arg 


100 






105 








110 







<210> 265 
<211> 999 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<400> 265 ^ „ 

atgcttaatc taggcatagt ccagatgaac gcagagccgc tcaacgtgga aggcaacctg bu 

ctcaaggcgg agcgctatgt cgcgaagtgc gccgcggacg gcgcccaact cgtggtgctg 120 

ccggagatgt tcaacgtcgg cttccacctc ggcgagtccc tgatgatggt cgccgagccc 180 

ctggacggca agaccgtgca gtggctgcaa cggcaggcgt ccacccataa catatatatc 240 

accgggagct tatacgagcg ttacgacgag catttctaca acaccatggt catggtggga 300 

tacgacggca gcgtgcagta ctaccgcaag cgcaatccta cctggtccga gtcggcggtg 360 

tggcgccgca gcgaggtgcc aggccccggt atattcgata ccccgttcgg gcgcatcggg 420 

ggcgtcatct gcttcgattc cttcgcgcgc gagacccacg agggcttcaa gcagagcggg 480 

gtcgaggcgg tggtaatcat cgccctgtgg ggcgccaacc gtgcgcgggc attcttctgg 540 

cgcccggacc tcctgctaag ccgggaaggg ctggtccgtt ggtcccggct ggcctcggag 600 

gacgtccccc gaaatcacgc gaaagagctc ggggtcccgg tcgccttcgt caaccagagt 660 

ggcaccatcc gcatgaccag ccccatccct ttccccgact ggccggtgca gagctccttc 720 

tacgacttca tcggcaagtc ccacgtccgg gacgcatccg gagaggtgat cgcgagggtg 780 

gacgaggggg agatcgactc ctgcctggtg gtcccggtag aggtcgagca ggcgcagagc 840 
aggccggaga tcaggaagtc aaatatatcg cccggctacc tcggcaagga ttactatttc 
gtggagccgc cgcttatctg caagctcttc caggtctggt tcctcagcgg cctggtgccc 
accgaatacg aggcgcggcg tctgcgccac ctgttctga 

<210> 266 
<211> 332 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<400> 266 
Met Leu As 
1 



900 
960 
999 
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Pro Thr Trp Ser Glu Ser Ala Val Trp Arg Arg Ser Glu Val Pro Gly 

115 120 125 

Pro Gly He Phe Asp Thr Pro Phe Gly Arg He Gly Gly Val He Cys 

130 135 140 

Phe Asp Ser Phe Ala Arg Glu Thr His Glu Gly Phe Lys Gin Ser Gly 
145 150 155 160 

Val Glu Ala Val Val He He Ala Leu Trp Gly Ala Asn Arg Ala Arg 

165 170 175 

Ala Phe Phe Trp Arg Pro Asp Leu Leu Leu Ser Arg Glu Gly Leu Val 

180 185 190 

Arg Trp Ser Arg Leu Ala Ser Glu Asp Val Pro Arg Asn His Ala Lys 

195 200 205 

Glu Leu Gly Val Pro Val Ala Phe Val Asn Gin Ser Gly Thr He Arg 

210 215 220 

Met Thr Ser Pro He Pro Phe Pro Asp Trp Pro Val Gin Ser Ser Phe 
225 230 235 240 

Tyr Asp Phe He Gly Lys Ser His Val Arg Asp Ala Ser Gly Glu Val 

245 250 255 

He Ala Arg Val Asp Glu Gly Glu He Asp Ser Cys Leu Val Val Pro 

260 265 270 

Val Glu Val Glu Gin Ala Gin Ser Arg Pro Glu He Arg Lys Ser Asn 

275 280 285 

He Ser Pro Gly Tyr Leu Gly Lys Asp Tyr Tyr Phe Val Glu Pro Pro 

290 295 300 

Leu He Cys Lys Leu Phe Gin Val Trp Phe Leu Ser Gly Leu Val Pro 
305 310 315 320 

Thr Glu Tyr Glu Ala Arg Arg Leu Arg His Leu Phe 
325 330 

<210> 267 
<211> 1038 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 267 

atgggcatta cgcatccgaa gttcaaggcc gcggcggtac aggcggcgcc gggctttctc 60 

gacagcgagg ccaccgtcga caagacgatc cgcctgatgc aggaagcggc ggaccacggc 120 

gcctcgctga tcgtctttcc ggaagcctgg ctgcccggtt atccgtggtg gatctggctc 180 

ggtccgcccg cctggggcat gcagttcgtg cagcgctact tcgacaattc gccgagcgtc 240 

ggcgatgatc ttttccgccg gatcgagcgc gcggccgcca aggcgaagat cgaagtggtc 300 

ctcggtctca gcgagcgcgc tgccggctcg ctgtacctcg cgcaggcgtt catctcctca 360 

acgggcgaga cgcgcgcagt gcgccgcaag ttgcgaccaa cgcacgtcga gcgaaccgtt 420 

ttcggcgagg gcgatggcag cgacttcaag gtgttcgaca ctccgctggg ccgcgtcggt 480 

ggtctcttgt gctgggaaca cctgcaaccg ctgtcgcgct acgcgatgtt ctcgatgaac 540 

gagcaggtgc acgccgccgc ctggccgacg ttcagcctct acacggattt tgcccatgcc 600 

ctcggccacg aactgaatct cgcagccagc gctacttacg cggctgaagg gcagtgctac 660 

gtgattgccg cctgtggcgt ggtcacgcag gagatgctgg atctgatgaa ggcgccgtgc 720 

cccccggaat atctgcgggt cggcggcgga tacgccatga tctttgcgcc cgacggacgg 780 

cgcattgcgg cggcgctgcc gccggaacaa gaagggctga tttacgccga catcgatctt 840 

tcgatgatct ctctcgccaa ggcggctgcc gatcccaccg gtcactactc gcggccggat 900 

gtcgtgcggc ttatgctgaa taccgaaccg atgcagcggg tcgaaaagct gcagccgccg 960 

ctggactcag ccgcgcgccg tgagaatgaa ccggcacgcg agaccgcagc ggcgaccgag 1020 

agccgccagc cccagtaa 1038 

<210> 268 
<211> 344 
<212> PRT 
<213> Unknown 
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<220> 

<223> Obtained from an environmental sample 



<400> 268 

Met Gly lie Thr His Pro Lys Phe Lys Ala Ala Ala Val Gin Ala Ala 

15 10 15 

Pro Gly Phe Leu Asp Ser Glu Ala Thr Val Asp Lys Thr He Arg Leu 

20 25 30 

Met Gin Glu Ala Ala Asp His Gly Ala Ser Leu He Val Phe Pro Glu 

35 40 45 

Ala Trp Leu Pro Gly Tyr Pro Trp Trp He Trp Leu Gly Pro Pro Ala 

50 55 60 

Trp Gly Met Gin Phe Val Gin Arg Tyr Phe Asp Asn Ser Pro Ser Val 
65 70 75 80 

Gly Asp Asp Leu Phe Arg Arg He Glu Arg Ala Ala Ala Lys Ala Lys 

85 90 95 

He Glu Val Val Leu Gly Leu Ser Glu Arg Ala Ala Gly Ser Leu Tyr 

100 105 HO 

Leu Ala Gin Ala Phe He Ser Ser Thr Gly Glu Thr Arg Ala Val Arg 

115 120 125 

Arg Lys Leu Arg Pro Thr His Val Glu Arg Thr Val Phe Gly Glu Gly 

130 135 140 

Asp Gly Ser Asp Phe Lys Val Phe Asp Thr Pro Leu Gly Arg Val Gly 
145 150 155 160 

Gly Leu Leu Cys Trp Glu His Leu Gin Pro Leu Ser Arg Tyr Ala Met 

165 170 175 

Phe Ser Met Asn Glu Gin Val His Ala Ala Ala Trp Pro Thr Phe Ser 

180 185 190 

Leu Tyr Thr Asp Phe Ala His Ala Leu Gly His Glu Leu Asn Leu Ala 

195 200 205 

Ala Ser Ala Thr Tyr Ala Ala Glu Gly Gin Cys Tyr Val He Ala Ala 

210 215 220 

Cys Gly Val Val Thr Gin Glu Met Leu Asp Leu Met Lys Ala Pro Cys 
225 230 235 240 

Pro Pro Glu Tyr Leu Arg Val Gly Gly Gly Tyr Ala Met He Phe Ala 

245 250 255 

Pro Asp Gly Arg Arg He Ala Ala Ala Leu Pro Pro Glu Gin Glu Gly 

260 265 270 

Leu He Tyr Ala Asp He Asp Leu Ser Met He Ser Leu Ala Lys Ala 

275 280 285 

Ala Ala Asp Pro Thr Gly His Tyr Ser Arg Pro Asp Val Val Arg Leu 

290 295 300 

Met Leu Asn Thr Glu Pro Met Gin Arg Val Glu Lys Leu Gin Pro Pro 
305 310 315 320 

Leu Asp Ser Ala Ala Arg Arg Glu Asn Glu Pro Ala Arg Glu Thr Ala 

325 330 335 

Ala Ala Thr Glu Ser Arg Gin Pro 
340 



<210> 269 
<2U> 1014 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
<400> 269 

atgtctgaaa cagccttcaa gatcgcggtc gtacaggcgg ctccggtttt tctcgacgca 
aaggcgacgg tggacaaggc gatcggtctg atggccgaag ccggcgccaa gggcgccaag 
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ctgctcgcat tcccggaagt attcatcccc ggctaccctt ggtggctgtg gctgggcaca 180 

ccggcatggg gcatgcagtt tgttgccaag tatcacgcga actcgcttcg tgcagacggg 24 0 

cctgaattgg cagccctcgc ggcggcggcg gcgaagtccg atatcaatgc cgtcatcggc 300 

ttctcggaga tcgacggcgg ttccctctac atcagccagg cgctcatcag cgacaagggc 360 

gagataatgt tcaaacggcg caagctgaag ccgacgcacg tcgaacgcac gttgttcggc 420 

gaaggggacg ggtccgactt ccaggtcgtg gacacgagcg tcggcaggct cggtgccttg 4 80 

tgttgcgccg aacacataca gccgctgtcg aagtacgcga tgtactccat gcacgaacag 540 

gtgcacgtcg cctcctggcc gtcatttact ttgtaccgcg gcacggcata tgccttgggc 600 

cacgaggtca atctggccgc gagccagatt tatgcgctcg agggaggctg tttcgtcctt 660 

catgcgagcg ccatcaccgg ccaggacatg tttgacgtgc tgtgcgacac tccggagagg 720 

acgcaactgc tgaactccga cggcggcaag gtcggcggcg gctactcgat gatcttcggt 7 80 

cccgatggcc agccccttgt tgggcatctg cctcaagaca ccgagggaat actctacgca 840 

gatattgacc tggcgaacat ttccgttgcc aaagcggcct acgacccgtc cggacactat 900 

gcgcgcggag acgtggtgcg cttgatggtc aatcgcaacc cgcggcatac gagtgttgcg 960 

ttcggcgggg gcgccggcga ggcagcaacc tggacggaag caaaagcgga gtga 1014 

<210> 270 
<211> 337 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 270 

Met Ser Glu Thr Ala Phe Lys He Ala Val Val Gin Ala Ala Pro Val 

I 5 10 15 

Phe Leu Asp Ala Lys Ala Thr Val Asp Lys Ala He Gly Leu Met Ala 

20 25 30 

Glu Ala Gly Ala Lys Gly Ala Lys Leu Leu Ala Phe Pro Glu Val Phe 

35 40 45 

He Pro Gly Tyr Pro Trp Trp Leu Trp Leu Gly Thr Pro Ala Trp Gly 

50 55 60 

Met Gin Phe Val Ala Lys Tyr His Ala Asn Ser Leu Arg Ala Asp Gly 
65 70 75 80 

Pro Glu Leu Ala Ala Leu Ala Ala Ala Ala Ala Lys Ser Asp He Asn 

85 90 95 

Ala Val He Gly Phe Ser Glu He Asp Gly Gly Ser Leu Tyr He Ser 

100 105 HO 

Gin Ala Leu He Ser Asp Lys Gly Glu He Met Phe Lys Arg Arg Lys 

115 120 125 

Leu Lys Pro Thr His Val Glu Arg Thr Leu Phe Gly Glu Gly Asp Gly 

130 135 140 

Ser Asp Phe Gin Val Val Asp Thr Ser Val Gly Arg Leu Gly Ala Leu 
145 150 155 160 

Cys Cys Ala Glu His He Gin Pro Leu Ser Lys Tyr Ala Met Tyr Ser 

165 170 175 

Met His Glu Gin Val His Val Ala Ser Trp Pro Ser Phe Thr Leu Tyr 

180 185 190 

Arg Gly Thr Ala Tyr Ala Leu Gly His Glu Val Asn Leu Ala Ala Ser 

195 200 205 

Gin He Tyr Ala Leu Glu Gly Gly Cys Phe Val Leu His Ala Ser Ala 

210 215 220 

He Thr Gly Gin Asp Met Phe Asp Val Leu Cys Asp Thr Pro Glu Arg 
225 230 235 240 

Thr Gin Leu Leu Asn Ser Asp Gly Gly Lys Val Gly Gly Gly Tyr Ser 

245 250 255 

Met He Phe Gly Pro Asp Gly Gin Pro Leu Val Gly His Leu Pro Gin 

260 265 270 

Asp Thr Glu Gly He Leu Tyr Ala Asp He Asp Leu Ala Asn He Ser 
275 280 285 
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Val Ala Lys Ala Ala Tyr Asp Pro Ser Gly His Tyr Ala Arg Gly Asp 

290 295 300 

Val Val Arg Leu Met Val Asn Arg Asn Pro Arg His Thr Ser Val Ala 
305 310 315 320 

Phe Gly Gly Gly Ala Gly Glu Ala Ala Thr Trp Thr Glu Ala Lys Ala 
325 330 335 

Glu 



<210> 271 
<211> 966 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 271 

atgtccagcg agaataacaa cgctacattc aaagttgccg cagttcaggc cacacctgtg 60 

tatcttgatc gtgaagcaac catcgacaag gcttgcgagt tgatcgctac tgctggcagc 120 

gaaggagctc gcctgattat ctttccagaa gcgttcatcc caacctatcc tgagtgggta 180 

tggggtattc cttctggtga gcaaggttta ctcaacgagc tctattcaga gttgctcacc 240 

aattcggtca cgattcccag cgacgcgact gacagactgt gcgaggccgc gaagcttgct 300 

aatgcctacg tggtgatggg aatgagtgaa cggaatgtcg aagcgagtgg tgcaagcctg 360 

tataatacgc tcttgtacat agatgcgcag ggggagattt tagggaaaca tcgaaagttg 420 

gtaccaacgg gcggtgagcg cctggtatgg gcgcaaggtg atggcagcac gctgcaggtc 4 80 

tacgatactc cattgggaaa actcggtggt ttaatttgct gggaaaatta tatgccactg 540 

gcacgctacg ctatgtatgc ctggggtaca caaatctatg tcgcagcaac gtgggatcgc 600 

ggccaaccct ggctctcaac gttacggcat attgccaaag aaggcagggt atacgtaatt 660 

ggttgctgta ttgcgatgcg taaagatgat attccagatc gttactccat gaagcagaag 720 

tattacgctg aaatggagga atggattaat attggtgaca gcgcgattgt caatcccgaa 780 

•ggacacttta ttgcagggcc tgtgcgcaag caagaagaaa ttctttacgc ggagatcgat 840 

ccacgcatgg tgcaaggccc gaagtggatg ctcgatgtgg ctgggcacta tgcgagacca 900 

gatgtgttcc agttgacggt gcatacggat gtgaggcaga tgattcgggt ggaacatgat 960 

tcataa 966 



<210> 272 
<211> 321 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 272 
















Lys Val Ala Ala 


Val 


Gin 


Met 


Ser 


Ser 


Glu 


Asn 


Asn 


Asn 


Ala 


Thr 


Phe 


1 








5 










10 




15 




Ala 


Thr 


Pro 


Val 


Tyr 


Leu 


Asp 


Arg 


Glu 


Ala 


Thr He Asp Lys 


Ala 


Cys 






20 










25 




30 






Glu 


Leu 


He 


Ala 


Thr 


Ala 


Gly 


Ser 


Glu Gly 


Ala Arg Leu He 


He 


Phe 




35 










40 






45 


He 




Pro 


Glu 


Ala 


Phe 


He 


Pro 


Thr 


Tyr 


Pro 


Glu 


Trp Val Trp Gly 


Pro 




50 










55 








60 




Thr 


Ser 


Gly 


Glu 


Gin 


Gly 


Leu 


Leu 


Asn 


Glu 


Leu 


Tyr Ser Glu Leu 


Leu 


65 








70 










75 




80 


Asn 


Ser 


Val 


Thr 


He 


Pro 


Ser 


Asp 


Ala 


Thr 


Asp Arg Leu Cys 


Glu 


Ala 








85 










90 




95 




Ala 


Lys 


Leu 


Ala 


Asn 


Ala 


Tyr 


Val 


Val 


Met 


Gly Met Ser Glu 


Arg 


Asn 






100 










105 




110 


He 


Asp 


Val 


Glu 


Ala 
115 


Ser 


Gly 


Ala 


Ser 


Leu 
120 


Tyr 


Asn 


Thr Leu Leu Tyr 
125 
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Ala Gin Gly Glu lie Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly 

130 135 140 

Gly Glu Arg Leu Val Trp Ala Gin Gly Asp Gly Ser Thr Leu Gin Val 
145 150 155 160 

Tyr Asp Thr Pro Leu Gly Lys Leu Gly Gly Leu lie Cys Trp Glu Asn 

165 170 175 

Tyr Met Pro Leu Ala Arg Tyr Ala Met Tyr Ala Trp Gly Thr Gin He 

180 185 190 

Tyr Val Ala Ala Thr Trp Asp Arg Gly Gin Pro Trp Leu Ser Thr Leu 

195 200 205 

Arg His He Ala Lys Glu Gly Arg Val Tyr Val He Gly Cys Cys He 

210 215 220 

Ala Met Arg Lys Asp Asp He Pro Asp Arg Tyr Ser Met Lys Gin Lys 
225 230 235 240 

Tyr Tyr Ala Glu Met Glu Glu Trp He Asn He Gly Asp Ser Ala He 

245 250 255 

Val Asn Pro Glu Gly His Phe He Ala Gly Pro Val Arg Lys Gin Glu 

260 265 270 

Glu He Leu Tyr Ala Glu He Asp Pro Arg Met Val Gin Gly Pro Lys 

275 280 285 

Trp Met Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Gin 

290 295 300 

Leu Thr Val His Thr Asp Val Arg Gin Met He Arg Val Glu His Asp 
305 310 315 320 

Ser 



<210> 273 
<211> 1023 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 273 

atgaataact caccaccaac catccgcgct gctgccattc agctcagtcc agttctgttt 
agtcgggatg ggactaccga gaaggtgttg caggcgatcg ctagtgctgc caaggaaggg 120 
gcacaactgg ttgtttttcc agaaactttc atcccctact acccctattt ctcattcatt 180 
caaccccctg tgttgatggg caaagagcac atgcggctct atgaggaagc cgtgacggtc 240 
ccgggtccgg tgacagatgc ggtcagtcga gcagcccgtt cttacggcat ggtggtagtg 
ctgggggtga atgagcgaga tggtggctca atttacaata cacagttgat tttcgatgct 
gacggcacat tgttgctgaa gcgacgcaaa atcaccccta cctatcatga gcgcatggtc 420 
tgggggcagg gagacggtgc tggattgaag gtattggata cagcagtcgg taaggtgggt 480 
gcgctggcat gttgggaaca ttacaatccc ctggcacgat ttgcgctgat ggcacagcat 540 
gagcagattc actgcgctca gttccccggt tctctggtgg gacaaatttt cactgatcag 600 
attgaggtaa cgattcggca tcatgcgttg gaatcgggtt gttttgtggt gaatgctact 660 
ggctggctct ctccagaaca ggtggcacaa atcaccacgg atgaaaagtt gcaacgggtg 720 
ctgagtggcg ggtgtaatac cgccattatt ggacctgaag gcaatcatct ctgtcctccc 780 
attaccgatg gtgagggcat agcgatcgcc gatctcgact tctcactaat caccaaacgc 840 
aaacgcatga tggattgcgt cggtcactac tcccgccctg acttgttgaa gctgcaactc 900 
aatgcaacgg catggtcggt gctggctggg gagcaggggg caggtgccag ggagcagggg 960 
ctaggtgtgc cggatgccat gctgtctacg cctaagccag aatactcaac actggatcag 1020 
tag 

<210> 274 
<211> 340 
<212> PRT 
<213> Unknown 

<220> 
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<223> Obtained from an environmental sample 
<400> 274 

Met Asn Asn Ser Pro Pro Thr lie Arg Ala Ala Ala lie Gin Leu Ser 

1 5 10 15 

Pro Val Leu Phe Ser Arg Asp Gly Thr Thr Glu Lys Val Leu Gin Ala 

20 25 30 

lie Ala Ser Ala Ala Lys Glu Gly Ala Gin Leu Val Val Phe Pro Glu 

35 40 45 

Thr Phe He Pro Tyr Tyr Pro Tyr Phe Ser Phe He Gin Pro Pro Val 

50 55 60 

Leu Met Gly Lys Glu His Met Arg Leu Tyr Glu Glu Ala Val Thr Val 
65 70 75 80 

Pro Gly Pro Val Thr Asp Ala Val Ser Arg Ala Ala Arg Ser Tyr Gly 

85 90 95 

Met Val Val Val Leu Gly Val Asn Glu Arg Asp Gly Gly Ser He Tyr 

100 105 HO 

Asn Thr Gin Leu He Phe Asp Ala Asp Gly Thr Leu Leu Leu Lys Arg 

115 120 125 

Arg Lys He Thr Pro Thr Tyr His Glu Arg Met Val Trp Gly Gin Gly 

130 135 140 

Asp Gly Ala Gly Leu Lys Val Leu Asp Thr Ala Val Gly Lys Val Gly 
145 150 155 160 

Ala Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Phe Ala Leu 

165 170 175 

Met Ala Gin His Glu Gin He His Cys Ala Gin Phe Pro Gly Ser Leu 

180 185 190 

Val Gly Gin He Phe Thr Asp Gin He Glu Val Thr He Arg His His 

195 200 205 

Ala Leu Glu Ser Gly Cys Phe Val Val Asn Ala Thr Gly Trp Leu Ser 

210 215 220 

Pro Glu Gin Val Ala Gin He Thr Thr Asp Glu Lys Leu Gin Arg Val 
225 230 235 240 

Leu Ser Gly Gly Cys Asn Thr Ala He He Gly Pro Glu Gly Asn His 

245 250 255 

Leu Cys Pro Pro He Thr Asp Gly Glu Gly He Ala He Ala Asp Leu 

260 265 270 

Asp Phe Ser Leu He Thr Lys Arg Lys Arg Met Met Asp Cys Val Gly 

275 280 285 

His Tyr. Ser Arg Pro Asp Leu Leu Lys Leu Gin Leu Asn Ala Thr Ala 

290 295 300 

Trp Ser Val Leu Ala Gly Glu Gin Gly Ala Gly Ala Arg Glu Gin Gly 
305 310 315 320 

Leu Gly Val Pro Asp Ala Met Leu Ser Thr Pro Lys Pro Glu Tyr Ser 
325 330 335 

Thr Leu Asp Gin 
340 

<210> 275 
<211> 849 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 275 

atggagacgg ctcacaaagc aaaggtcgat ttccttgtgc tgggtgagac gtggctctca 60 

ggctacccgg cttggctgga ccactgcccc gatgttggcc ggtgggatta tgaaccgatg 120 

aaaaaagtgt atttgagatt tcgacaaagt gctatttctg ttcctggcaa agaatttgat 180 

ttccttactg gcctctgtaa aaaatattca caaacgcttg ccatcggtgt taatgagaaa 2 40 
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300 
360 



840 
849 



gtagatcatg gggtaggtaa tggtaccatt tataattcat ttctactgat tgattctgat 
ggaacactgt tgaatcatca tcgcaagtta gttcccactt ttactgagaa attattatac 
ggccatggag atggccatgg gctgaagtcg atggatactt cggtgggaag aatcggaggg 420 
agcatttgtt gggaacattg gatgccacta tgcagacaag cacttcatga tgcaggtgag 480 
caaatccatg ttgccctttg gccgactgtt catgacatcc atcaagtggc aagtagaagc 540 
tatgcatttg aagggcgctg ctttgtattg gctgccgggc agatttttgc tgctaaagat 600 
tttccaaagg aacttgtctt accagactat ctaaagcaaa atccggatca gctcattttg 660 
aatgggggga gctgcgtgat cggccctgat gggaaatatt tgattgagcc cgtgtttgat 720 
cgggaagaac tgattgtgtg tgaacttgac cttgacgaag cttataaaga aagaatgacg 780 
atggacgttt caggtcacta ccaaagacga gacgttttca gttttgacgt gaaccaacat 
cgacattga 

<210> 276 

<211> 310 

<212> PRT 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 276 

Met Thr Lys Leu Lys He Ala He Gly Gin Phe Ser Ser Asn His Leu 

I 5 10 15 

Asp Leu Lys Cys Ser Leu Glu Lys Leu Glu Lys He Met Glu Thr Ala 

20 25 30 

His Lys Ala Lys Val Asp Phe Leu Val Leu Gly Glu Thr Trp Leu Ser 

35 40 45 

Gly Tyr Pro Ala Trp Leu Asp His Cys Pro Asp Val Gly Arg Trp Asp 

50 55 60 

Tyr Glu Pro Met Lys Lys Val Tyr Leu Arg Phe Arg Gin Ser Ala He 
65 70 75 80 

Ser Val Pro Gly Lys Glu Phe Asp Phe Leu Thr Gly Leu Cys Lys Lys 

85 90 95 

Tyr Ser Gin Thr Leu Ala He Gly Val Asn Glu Lys Val Asp His Gly 

100 105 HO 

Val Gly Asn Gly Thr He Tyr Asn Ser Phe Leu Leu He Asp Ser Asp 

115 120 125 

Gly Thr Leu Leu Asn His His Arg Lys Leu Val Pro Thr Phe Thr Glu 

130 135 140 

Lys Leu Leu Tyr Gly His Gly Asp Gly His Gly Leu Lys Ser Met Asp 
145 150 155 160 

Thr Ser Val Gly Arg He Gly Gly Ser He Cys Trp Glu His Trp Met 

165 170 175 

Pro Leu Cys Arg Gin Ala Leu His Asp Ala Gly Glu Gin He His Val 

180 185 190 

Ala Leu Trp Pro Thr Val His Asp He His Gin Val Ala Ser Arg Ser 

195 200 205 

Tyr Ala Phe Glu Gly Arg Cys Phe Val Leu Ala Ala Gly Gin He Phe 

210 215 220 

Ala Ala Lys Asp Phe Pro Lys Glu Leu Val Leu Pro Asp Tyr Leu Lys 
225 230 235 240 

Gin Asn Pro Asp Gin Leu He Leu Asn Gly Gly Ser Cys Val He Gly 

245 250 255 

Pro Asp Gly Lys Tyr Leu He Glu Pro Val Phe Asp Arg Glu Glu Leu 

260 265 270 

He Val Cys Glu Leu Asp Leu Asp Glu Ala Tyr Lys Glu Arg Met Thr 

275 ' 280 285 

Met Asp Val Ser Gly His Tyr Gin Arg Arg Asp Val Phe Ser Phe Asp 

290 295 300 

Val Asn Gin His Arg His 
305 310 
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<210> 277 
<211> 1056 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 277 

atgccaaccc ccagcgatca tttcaaaatc gccgctgttc aggcctcgcc cgtgtttctg 60 

gaccgggagg ccactgtgga aaaggcctgc cggttgatcg ccgaagccgc aaagcagggc 120 

gcccgcctca tcgtctttcc ggaatctttc atcccgacct acccggactg ggtatgggcc 180 

gttcccccgg gaagggaaag aatcctgaac cagctgtatt ctgaattcct ggccaatgcc 240 

gtcgatgttc ccggcgcggc gaccgaacaa cttgcccagg ctgcacgaat ggccggcgcc 300 

tatgtgatta tgggcgtcac cgaaagagac acctcggcca gcggggccag cctctacaac 360 

accctgctct acttcagccc cgaaggcatc ctaatgggca aacaccggaa gctggttccc 420 

acggggggcg aacggctggt ctgggcctac ggagacggca gcacgctgga ggtctacgac 480 

actccgctgg gaaagatcgg cgggctgatc tgctgggaga actacatgcc cctggcccgg 540 

tacacgatgt acgcctgggg cacccagatt tacatcgccg ccacctggga ccgcggggaa 600 

ccgtggctct ccaccctgcg gcatatcgcc aaggaaggaa gggtctacgt catcgggtgc 660 

tgcatcgccc tgcgccaggg ggatatcccg gaccggttcg agtacaaggg aaaattttat 720 

tccgggtccc gggagtggat caatgagggc gacagcgcca tcgtgaaccc ggacggggaa 780 

ttcatcgccg ggccggtgcg gatgaaggag gagatcctgt atgccgagat agacccccgg 840 

cagatgcggg gccccaagtg gatgctcgat gtggccggtc attacgcccg gccggatatc 900 
ttcgagctca tcgtccaccg gaatccccac ccgatgatca aaatcgccga agacaggggc ' 960 

gcggggatcg cctcaagttt gattcgcccc cgccctaacc ttcccccatc aagggggagg 1020 

aaatcggcaa gaagcaaacg caagcccaaa aaatga 1056 

<210> 278 
<211> 351 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 278 

Met Pro Thr Pro Ser Asp His Phe Lys lie Ala Ala Val Gin Ala Ser 

15 10 15 

Pro Val Phe Leu Asp Arg Glu Ala Thr Val Glu Lys Ala Cys Arg Leu 

20 25 30 

lie Ala Glu Ala Ala Lys Gin Gly Ala Arg Leu lie Val Phe Pro Glu 

35 40 45 

Ser Phe He Pro Thr Tyr Pro Asp Trp Val Trp Ala Val Pro Pro Gly 

50 55 60 

Arg Glu Arg He Leu Asn Gin Leu Tyr Ser Glu Phe Leu Ala Asn Ala 
65 70 75 80 

Val Asp Val Pro Gly Ala Ala Thr Glu Gin Leu Ala Gin Ala Ala Arg 
85 90 95 

* Met Ala Gly Ala Tyr Val He Met Gly Val Thr Glu Arg Asp Thr Ser 
100 105 HO 

Ala Ser Gly Ala Ser Leu Tyr Asn Thr Leu Leu Tyr Phe Ser Pro Glu 

115 120 125 

Gly He Leu Met Gly Lys His Arg Lys Leu Val Pro Thr Gly Gly Glu 

130 135 140 

Arg Leu Val Trp Ala Tyr Gly Asp Gly Ser Thr Leu Glu Val Tyr Asp 
145 150 155 160 

Thr Pro Leu Gly Lys He Gly Gly Leu He Cys Trp Glu Asn Tyr Met 

165 170 175 

Pro Leu Ala Arg Tyr Thr Met Tyr Ala Trp Gly Thr Gin He Tyr He 
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180 



Ala 


Ala 


Thr 


Trp Asp Arg Gly Glu 






195 




200 


lie 


Ala 


Lys 


Glu Gly Arg Val 


Tyr 




210 




215 




Arg 


Gin 


Gly 


Asp He Pro Asp Arg 


225 






230 




Ser Gly 


Ser 


Arg Glu Trp He 


Asn 








245 




Pro 


Asp 


Gly 


Glu Phe He .Ala 


Gly 








260 




Leu 


Tyr 


Ala 


Glu He Asp Pro 


Arg 




275 




280 


Leu 


Asp 


Val 


Ala Gly His Tyr 


Ala 




290 




295 




Val 


His 


Arg 


Asn Pro His Pro 


Met 


305 






310 




Ala Gly 


He 


Ala Ser Ser Leu 


He 








325 




Ser 


Arg 


Gly 


Arg Lys Ser Ala 


Arg 



340 



185 






190 




Pro 


Trp 


Leu Ser 


Thr Leu Arg 


His 








205 




Val 


He 


Gly Cys 


Cys He Ala 


Leu 






220 






Phe 


Glu 


Tyr Lys 


Gly Lys Phe 


Tyr 






235 




240 


Glu 


Gly 


Asp Ser 


Ala He Val 


Asn 




250 




255 




Pro 


Val 


Arg Met 


Lys Glu Glu 


He 


265 






270 




Gin 


Met 


Arg Gly 


Pro Lys Trp 


Met 








285 




Arg 


Pro 


Asp He 


Phe Glu Leu 


He 






300 






He 


Lys 


He Ala 


Glu Asp Arg 


Gly 




315 




320 


Arg 


Pro 


Arg Pro 


Asn Leu Pro 


Pro 




330 




335 




Ser 


Lys 


Arg Lys 


Pro Lys Lys 




345 






350 





<210> 279 
<211> 1014 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 279 

atgggcatta cccatcccaa atacaaagtc gctgcagtcc aggccgcgcc ggtctggctg 60 

gacctggatg ccacggtgga caagtgcatc cgcctgatcc aggaggccgc tgacaagggc 120 

tgcaagctga tcgcgtttcc ggagacgttc attcccggct atccctggca catctggatg 180 

ggtgcgccgg cctgggccat cggccggggc tttgtgcagc ggtatttcga caactccctg 240 

tcctatgaca gcccgcaggc cgaaaagctg cgccaggccg tcaaggccgc aggcatcacc 300 

gcgtcgctgg gactgtcgga gcgctcgggc ggcagtctct acatcgcgca gtggctcatt 360 

ggccccgacg gcgaaacgat ctcgcagcgg cgtaagctgc ggcccacgca ttccgaacgc 420 

accgtcttcg gcgacggcga tgggagcgac ctcaaggtgc acgacacgcc gctgggccgt 480 

gtgggtgagc tggcgtgctg ggagaacatc ctgtcgctga acaagtacgc catgttctcg 540 

cagcacgagc aggtgcacat tgcagcctgg cctagcttct ccacctacga gccctttgca 600 

catgccctgg gctgggaagt gaacaacgca gtcagcaagg tgtacgcagt ggaaggcggg 660 

tgtttcgtgg tggcaccctg cgccatcatt tccaaggaga tggtcgatga actgtgcgac 720 

accccggaca agcacaccct gacccatgtg ggaggcggcc acgcggtgat ctacgggccg 780 

gatggcgccc ccctggcgga caagctgcct gaagacgccg agggcctgct gatcgcggag 840 

atcgacctcg ggatgattgg cgtggccaag aacgccatgg acccggtagg gcactactcg 900 

cgccccgacg tgcaccgcct gcttctcaac cgcaagaagg cagcgcaggt ggagcacttt 960 

gcgctgccgg tggacgccat cgattccgag ccccaagcca ccacggcgca ctga 1014 

<210> 280 
<211> 337 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 280 

Met Gly He Thr His Pro Lys Tyr Lys Val Ala Ala Val Gin Ala Ala 

15 10 * 5 

Pro Val Trp Leu Asp Leu Asp Ala Thr Val Asp Lys Cys He Arg Leu 
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He Gin Glu 
35 

Thr Phe He 
50 

Trp Ala He 
65 

Ser Tyr Asp 

Ala Gly He 

Leu Tyr He 
115 

Gin Arg Arg 

130 
Asp Gly Asp 
145 

Val Gly Glu 

Ala Met Phe 

Phe Ser Thr 
195 

Asn Ala Val 

210 
Ala Pro Cys 
225 

Thr Pro Asp 
He Tyr Gly 



Ala Glu Gly 
275 

Ala Lys Asn 

290 
His Arg Leu 
305 

Ala Leu Pro 
His 



20 

Ala Ala 

Pro Gly 

Gly Arg 

Ser Pro 
85 

Thr Ala 
100 

Ala Gin 

Lys Leu 

Gly Ser 

Leu Ala 
165 
Ser Gin 
180 

Tyr Glu 

Ser Lys 

Ala He 

Lys His 
' 245 
Pro Asp 
260 

Leu Leu 



Asp Lys 



Ala Met 
Leu Leu 



Val Asp 
325 



Tyr Pro 

55 
Gly Phe 
70 

Gin Ala 

Ser Leu 

Trp Leu 

Arg Pro 
135 
Asp Leu 
150 

Cys Trp 

His Glu 

Pro Phe 

Val Tyr 
215 
He Ser 
230 

Thr Leu 

Gly Ala 

He Ala 

Asp Pro 
295 
Asn Arg 
310 

Ala He 



25 

Gly Cys Lys 
40 

Trp His He 



30 

Phe Pro Glu 



Leu He Ala 
45 

Trp Met Gly Ala Pro Ala 
60 

Val Gin Arg Tyr Phe Asp 
75 

Arg Gin Ala 



Glu Lys Leu 
90 

Gly Leu Ser 

105 
He Gly Pro 
120 

Thr His Ser 



Asn Ser Leu 
80 

Val Lys Ala 
95 

Glu Arg Ser Gly Gly Ser 
110 

Thr He Ser 



Asp Gly Glu 
125 

Glu Arg Thr Val Phe Gly 
140 

Lys Val His Asp Thr Pro 
155 

Leu Ser Leu 



Glu Asn He 
170 

Gin Val His 

185 
Ala His Ala 
200 

Ala Val Glu 



He Ala Ala 



Leu Gly Arg 
160 

Asn Lys Tyr 

175 
Trp Pro Ser 
190 

Glu Val Asn 



Leu Gly Trp 
205 

Gly Gly Cys Phe Val Val 
220 

Lys Glu Met Val Asp Glu 
235 

Gly Gly Gly 



Thr His Val 
250 

Pro Leu Ala 

265 
Glu He Asp 
280 

Val Gly His 

Lys Lys Ala 

Asp Ser Glu 
330 



Leu Cys Asp 
240 

His Ala Val 
255 

Asp Lys Leu Pro Glu Asp 
270 

He Gly Val 



Leu Gly Met 
285 

Tyr Ser Arg Pro Asp Val 
300 

Ala Gin Val Glu His Phe 
315 320 
Pro Gin Ala Thr Thr Ala 
335 



<210> 281 
<211> 936 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 281 

atgtccaaga tcgccgtcgt ccaagagcct ccggtgctgc tcgatcgcgc cgccaccctc 
gagcgcgccg tctcggccat cgagcgggcg gccgacgtgg gggcgcacct cgtcgtgttc 
cccgagacgt acgtcccggg gtaccccgac tgggtctggc ggacgcgccc cgacgacttc 
aagctcgcgg gcgcgctgca cgagcgcctc ctcgcgaacg cggtcgacct cgaaaaggac 
cagctcgcgc cgctccgcga ggccgcgcgc cggcgaggcg tcaccatcgc gtgcggcgtg 
aacgagcgcg aggggagcca cggccgcgcg accctctaca acaccgtcgt cgtcgtcgga 
cccgacggcg cgatcctcaa ccgccaccgc aagctcgtcc ccacgaaccc cgagcgcatg 
gtctggggcc cgggtgacgc gagcgggctg cgcgtcgtcg acacgccggc cggccgcgtg 
ggggcgctca tctgctggga gaactacatg ccgctcgcgc gcttcgcgct ctacgcgcag 
ggcgtcgagg tctacctcgc gccgacgtgg gatcacggcg acacttggct cgcctccatg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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cggcacatcg cgcgcgaggc gcgcgcctgg gtcgtctcgg gggccatctg catgcaggcg ' 660 

aaggacgtcc ccgccgactt cccgcagcgc gcggcgatct accccgacga ggaggagtgg 720 

ctcaaccccg gcgatgccgt cgtcgtcgat cccaccggcg ccgtcgccgc cggcccgctg 780 

caccgcgagc gcggcatcct ctacgccgag tgcgatcccg cgcgggcgtc gctcgcccgc 840 

cgcacgctcg acgtctccgg gcactacgga cggcccgacg tctttcacct gcagatcgac 900 

cgcacaccgc gcgtgccggc gtcgttccgg gactga 936 

<210> 282 
<211> 311 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 282 

Met Ser Lys lie Ala Val Val Gin Glu Pro Pro Val Leu Leu Asp Arg 

I 5 10 15 

Ala Ala Thr Leu Glu Arg Ala Val Ser Ala lie Glu Arg Ala Ala Asp 

20 25 30 

Val Gly Ala His Leu Val Val Phe Pro Glu Thr Tyr Val Pro Gly Tyr 

35 40 45 

Pro Asp Trp Val Trp Arg Thr Arg Pro Asp Asp Phe Lys Leu Ala Gly 

50 55 60 

Ala Leu His Glu Arg Leu Leu Ala Asn Ala Val Asp Leu Glu Lys Asp 
65 70 75 80 

Gin Leu Ala Pro Leu Arg Glu Ala Ala Arg Arg Arg Gly Val Thr He 

85 90 95 

Ala Cys Gly Val Asn Glu Arg Glu Gly Ser His Gly Arg Ala Thr Leu 

100 105 HO 

Tyr Asn Thr Val Val Val Val Gly Pro Asp Gly Ala He Leu Asn Arg 

115 120 125 

His Arg Lys Leu Val Pro Thr Asn Pro Glu Arg Met Val Trp Gly Pro 

130 135 140 

Gly Asp Ala Ser Gly Leu Arg Val Val Asp Thr Pro Ala Gly Arg Val 
145 150 155 160 

Gly Ala Leu He Cys Trp Glu Asn Tyr Met Pro Leu Ala Arg Phe Ala 

165 170 175 

Leu Tyr Ala Gin Gly Val Glu Val Tyr Leu Ala Pro Thr Trp Asp His 

180 185 190 

Gly Asp Thr Trp Leu Ala Ser Met Arg His He Ala Arg Glu Ala Arg 

195 200 205 

Ala Trp Val Val Ser Gly Ala He Cys Met Gin Ala Lys Asp Val Pro 

210 215 220 

Ala Asp Phe Pro Gin Arg Ala Ala He Tyr Pro Asp Glu Glu Glu Trp 
225 230 235 240 

Leu Asn Pro Gly Asp Ala Val Val Val Asp Pro Thr Gly Ala Val Ala 

245 250 255 

Ala Gly Pro Leu His Arg Glu Arg Gly He Leu Tyr Ala Glu Cys Asp 

260 265 270 

Pro Ala Arg Ala Ser Leu Ala Arg Arg Thr Leu Asp Val Ser Gly His 

275 280 285 

Tyr Gly Arg Pro Asp Val Phe His Leu Gin He Asp Arg Thr Pro Arg 

290 295 300 

Val Pro Ala Ser Phe Arg Asp 
305 310 

<210> 283 
<211> 1017 
<212> DNA 
<213> Unknown 
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<220> 

<223> Obtained from an environmental sample 
<400> 283 

atgggcatcg aacatccgaa gtacaaggtc gcggtggtgc aggcggcacc ggcctggctc 60 

gatctcgacg cgtcgatcga caagaccatc gggctgatcg aggaggccgc ccaaaaaggc 120 

gccaagctga ttgcattccc cgaggccttc atccccggtt acccctggca catctggatg 180 

gactcgccgg cctgggcgat cggccgcggt ttcgtgcagc gctattttga caattcgctc 240 

gcctatgaca gcccgcaggc cgagaaactg cgcgcggcgg ttcgcaaggc aaagctcacg 300 

gccgtgatcg ggctgtccga gcgcgacggc ggcagtcttt acctcgcgca atggctgatc 360 

gggcccgacg gtgagaccat cgccaagcgc cgcaagctgc ggccgacaca tgcggagcgc 420 

acggtatacg gcgagggcga cggcagcgac ctcgcggttc acaaccgtcc ggacattggc 480 

cgccttggcg cgctctgctg ctgggagcat ctccagccgc tgtcgaaata cgcgatgtac 540 

gcgcagaacg agcaggtgca tgtcgcggcc tggccgagct tttcgctgta cgatcccttc 600 

gcggtggcgc tcggcgccga agtgaacaac gcggcctcgc gcgtctatgc ggtcgaaggc 660 

tcctgcttcg tgctggcgcc gtgcgcgacg gtctcgcagg ccatgatcga cgaactctgc 720 

gaccggccgg acaagcacgc gctgttgcat gtcggcggcg gttttgccgc gatctacggt 780 

cctgacggca gccagatcgg cgacaaactc gctcccgacc aggaagggtt gctgatcgcg 840 

gagatcgatc tcggcgccat cggcgtcgcc aagaatgcgg cggatcccgc cgggcattat 900 

tcgcggcctg acgtgacgcg actgttgctc aacaagaagc cgtacaagcg cgtcgagcag 960 

ttttcgccgc cggccgaggc gctcgagccg acggatatcg cagcagcagc aagctaa 1017 

<210> 284 
<211> 338 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 284 

Met Gly lie Glu His Pro Lys Tyr Lys Val Ala Val Val Gin Ala Ala 

15 10 15 

Pro Ala Trp Leu Asp Leu Asp Ala Ser He Asp Lys Thr He Gly Leu 

20 25 30 

He Glu Glu Ala Ala Gin Lys Gly Ala Lys Leu He Ala Phe Pro Glu 

35 40 45 

Ala Phe He Pro Gly Tyr Pro Trp His He Trp Met Asp Ser Pro Ala 

50 55 60 

Trp Ala He Gly Arg Gly Phe Val Gin Arg Tyr Phe Asp Asn Ser Leu 
65 70 75 80 

Ala Tyr Asp Ser Pro Gin Ala Glu Lys Leu Arg Ala Ala Val Arg Lys 

85 90 95 

Ala Lys Leu Thr Ala Val He Gly Leu Ser Glu Arg Asp Gly Gly Ser 

100 105 HO 

Leu Tyr Leu Ala Gin Trp Leu He Gly Pro Asp Gly Glu Thr He Ala 

115 120 125 

Lys Arg Arg Lys Leu Arg Pro Thr His Ala Glu Arg Thr Val Tyr Gly 

130 135 140 

Glu Gly Asp Gly Ser Asp Leu Ala Val His Asn Arg Pro Asp He Gly 
145 150 155 160 

Arg Leu Gly Ala Leu Cys Cys Trp Glu His Leu Gin Pro Leu Ser Lys 

165 170 I 75 

Tyr Ala Met Tyr Ala Gin Asn Glu Gin Val His Val Ala Ala Trp Pro 

180 185 190 

Ser Phe Ser Leu Tyr Asp Pro Phe Ala Val Ala Leu Gly Ala Glu Val 

195 200 205 

Asn Asn Ala Ala Ser Arg Val Tyr Ala Val Glu Gly Ser Cys Phe Val 

210 215 220 

Leu Ala Pro Cys Ala Thr Val Ser Gin Ala Met He Asp Glu Leu Cys 
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225 230 235 240 

Asd Ara Pro Asp Lys His Ala Leu Leu His Val Gly Gly Gly Phe Ala 

245 250 255 

Ala He Tyr Gly Pro Asp Gly Ser Gin He Gly Asp Lys Leu Ala Pro 

260 265 270 

Asp Gin Glu Gly Leu Leu He Ala Glu He Asp Leu Gly Ala He Gly 

275 280 285 

Val Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp 

290 295 300 

Val Thr Arg Leu Leu Leu Asn Lys Lys Pro Tyr Lys Arg Val Glu Gin 
305 310 315 320 

Phe Ser Pro Pro Ala Glu Ala Leu Glu Pro Thr Asp lie Ala Ala Ala 
325 330 335 

Ala Ser 



<210> 285 
<211> 918 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 285 

atgaatacta atctagtaaa ggtcgcggcg gctcaagttg ctccccattt tctcaatttg 60 
agcaatacgg tggaaaaaac ctgcaactta atttctgaag caggcaaaaa tggagcaaag 120 
ctaatcgtat ttccagaagc cttcatctct ggttatcccg attgggtctg gcttattccc 180 
aatgcgaatt ctgcaatgct ggatgattta taccaggaat tggttgagaa cgcagtaacg 240 
atccctgata caacaacaca taaactatgt caggctgcaa aagatgcagg ggtatatgtt 300 
gcggtaggta tacatgagag aaattcagaa gcaagtggct tcacgctttt caatacgctt 360 
ctatacatca acgatcaagg cgtaatcatc ggaaaacacc gaaaattaat ccctacaggg 420 
ggcgaacggc tggtctgggg gcagggtaat ggggatacac tttctgcatt cgatacagac 480 
ttcggcaaat taggaggatt gctttgttgg gaaaattata tgccactcgc gcgtcaagct 540 
atgtattccg ttggaactga agtatatgta gccccaacct gggactccag cgagaattgg 600 
ttgttaagca tgcgccatat tgcccgagag ggcggtatgt ttgtaattag tgtttgccag 660 
gctctccgaa aagacgacat ccctgatcag tatgaattta agaaactcta tcctgataat 720 
tcagaatgga tcaatagcgg taacagttgc atcatcaacc cgcgcggtga gattattgct 
ggaccaatct caaaccaaca agaaatactt tatgcagatt tagacctgag tttaattgca 
aaatctaaac gtatgttcga tgttactggg cattattccc ggccggatgt gtttagctat 
gaaatcaaca aaagctag 

<210> 286 

<211> 305 

<212> PRT 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 286 

Met Asn Thr Asn Leu Val Lys Val Ala Ala Ala Gin Val Ala Pro His 
I 5 10 15 

Phe Leu Asn Leu Ser Asn Thr Val Glu Lys Thr Cys Asn Leu lie Ser 
20 25 30 

Glu Ala Gly Lys Asn Gly Ala Lys Leu He Val Phe Pro Glu Ala Phe 
35 40 45 

He Ser Gly Tyr Pro Asp Trp Val Trp Leu lie Pro Asn Ala Asn Ser 
• 50 55 60 

Ala Met Leu Asp Asp Leu Tyr Gin Glu Leu Val Glu Asn Ala Val Thr 

65 "70 75 80 



780 
840 
900 
918 
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He Pro Asp Thr Thr Thr His Lys Leu Cys Gin Ala Ala Lys Asp Ala 

85 90 95 

Gly Val Tyr Val Ala Val Gly He His Glu Arg Asn Ser Glu Ala Ser 

100 105 110 

Gly Phe Thr Leu Phe Asn Thr Leu Leu Tyr He Asn Asp Gin Gly Val 

115 120 125 

He He Gly Lys His Arg Lys Leu He Pro Thr Gly Gly Glu Arg Leu 

130 135 140 

Val Trp Gly Gin Gly Asn Gly Asp Thr Leu Ser Ala Phe Asp Thr Asp 
145 150 155 160 

Phe Glv Lys Leu Gly Gly Leu Leu Cys Trp Glu Asn Tyr Met Pro Leu 

165 170 175 

Ala Arg Gin Ala Met Tyr Ser Val Gly Thr Glu Val Tyr Val Ala Pro 

180 185 190 

Thr Trp Asp Ser Ser Glu Asn Trp Leu Leu Ser Met Arg His He Ala 

195 200 205 

Arg Glu Gly Gly Met Phe Val He Ser Val Cys Gin Ala Leu Arg Lys 

210 215 220 

Asp Asp He Pro Asp Gin Tyr Glu Phe Lys Lys Leu Tyr Pro Asp Asn 
225 230 235 240 

Ser Glu Trp He Asn Ser Gly Asn Ser Cys He He Asn Pro Arg Gly 

245 250 255 

Glu lie lie Ala Gly Pro He Ser Asn Gin Gin Glu He Leu Tyr Ala 

260 265 270 

Asp Leu Asp Leu Ser Leu He Ala Lys Ser Lys Arg Met Phe Asp Val 

275 280 285 

Thr Gly His Tyr Ser Arg Pro Asp Val Phe Ser Tyr Glu lie Asn Lys 
290 295 300 

Ser 
305 

<210> 287 
<211> 936 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 287 

gtgatcaagg tagcaatcgc ccaggtggca ccggtggttc tggacaaggc gcgcaccatt 60 

gagaaagcgg taggaattat tcgcgctgcc gcgcaagagg gcattgagct cctggttttc 120 

ccggagacgt ttatcccgac ctatccagcc tgggtatggc gcttgcgtcc gggtactgat 180 

tacggcctga gcgaggaact gcacgcgctc ctgctggata attcggtaga tatggagagc 240 

aaggacctgg agccattgca agctgttgct gcagagacca gcatgaccgt ggtaataggt 300 

atgaacgagc gagacggccg attcagccgg ggtacaatct acaatgccct ggttgtgatc 360 

ggtccaggtg gcacgatcct gaacaggcac cgcaagctta tgcccaccaa ccccgagcgt 420 

atggtttggg gtatgggcga tgccagcggg ctgaaggtag tggaaatgtc ttacgggcgc 480 

ctgggtgggc tgatttgctg ggagaatttc atgcctctcg cgcgctatgg cttgtatgcc 540 

cagggtgtgg agatttacgt ggcgcccacc tatgaccagg gcgacggctg ggtcggcagc 600 

atgcagcata tagcccggga gggtcgttgc tgggtactct cggccgggac ccttttgcgt 660 

ggcagtgatt ttctgccgga ttttccgggc aagaccgagt tatatcccga tgaccaggag 720 

tgggtgaatc cgggtggctc ggtgatcgtg gcaccggggg gagagattgt ggccggcccc 780 

atgtatcgcg acgaaggtct gctggtctgc gagttggatg cgacgcttag tgtccgcggc 840 

aagcgctcgc tggatgtggc cggccattac tcccggccgg atttgtttga actggaaata 900 

gatggcgacc cgctggaacc catagagtgg gattga 936 

<210> 288 
<211> 311 
<212> PRT 
<213> Unknown 
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<220> 

<223> Obtained from an environmental sample 
<400> 288 

Val He Lys Val Ala He Ala Gin Val Ala Pro Val Val Leu Asp Ly3 

15 10 15 

Ala Arg Thr He Glu Lys Ala Val Gly He He Arg Ala Ala Ala Gin 

20 25 30 

Glu Gly He Glu Leu Leu Val Phe Pro Glu Thr Phe He Pro Thr Tyr 

35 40 45 

Pro Ala Trp Val Trp Arg Leu Arg Pro Gly Thr Asp Tyr Gly Leu Ser 

50 55 60 

Glu Glu Leu His Ala Leu Leu Leu Asp Asn Ser Val Asp Met Glu Ser 
65 70 75 80 

Lys Asp Leu Glu Pro Leu Gin Ala Val Ala Ala Glu Thr Ser Met Thr 

85 90 95 

Val Val He Gly Met Asn Glu Arg Asp Gly Arg Phe Ser Arg Gly Thr 

100 105 110 

He Tyr Asn Ala Leu Val Val He Gly Pro Gly Gly Thr He Leu Asn 

115 120 125 

Arg His Arg Lys Leu Met Pro Thr Asn Pro Glu Arg Met Val Trp Gly 

130 135 140 

Met Gly Asp Ala Ser Gly Leu Lys Val Val Glu Met Ser Tyr Gly Arg 
145 150 155 160 

Leu Gly Gly Leu He Cys Trp Glu Asn Phe Met Pro Leu Ala Arg Tyr 

165 170 175 

Gly Leu Tyr Ala Gin Gly Val Glu He Tyr Val Ala Pro Thr Tyr Asp 

180 185 190 

Gin Gly Asp Gly Trp Val Gly Ser Met Gin His He Ala Arg Glu Gly 

195 200 205 

Arg Cys Trp Val Leu Ser Ala Gly Thr Leu Leu Arg Gly Ser Asp Phe 

210 215 220 

Leu Pro Asp Phe Pro Gly Lys Thr Glu Leu Tyr Pro Asp Asp Gin Glu 
225 230 235 240 

Trp Val Asn Pro Gly Gly Ser Val He Val Ala Pro Gly Gly Glu He 

245 250 255 

Val Ala Gly Pro Met Tyr Arg Asp Glu Gly Leu Leu Val Cys Glu Leu 

260 265 270 

Asp Ala Thr Leu Ser Val Arg Gly Lys Arg Ser Leu Asp Val Ala Gly 

275 280 285 

His Tyr Ser Arg Pro Asp Leu Phe Glu Leu Glu He Asp Gly Asp Pro 

290 295 300 

Leu Glu Pro He Glu Trp Asp 
305 310 

<210> 289 
<211> 921 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 289 

atggtgacgg tggccgccgt acaggcaacg ccggtgttcc tcgaccgcga ggcgacctcg 60 

gacaaggtct gcgccttggt caaggaggcg gccggccacg gggcagaact gatcgtcttc 120 

cccgagtcct tcgtccctgc ctatccggac tgggtgtggc gcacccctgc ctggagtgac 180 

accgagttcg tgaagcgctt ctacgcgaac gcggtgaccg tccccggcgc gaccctcgag 240 

cgcatcggcg cagcggcggc ggaggcggag gcgtacgtcg tgatcggcgt gaccgagatc 300 

gacggcggaa ctctctacaa cacccttctc tacctgggcc cggacggaca gctgttgcaa 360 
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420 
480 



780 
840 
900 
921 



cggcatcgca agctcatgcc caccggtggg gagcggaccg tgtggggaat gggagacggc 
tctgagctcg acgtcgtgag cacgccgttc ggcgtcgtcg gtgggttgtt gtgctgggag 
aactacatgc cgctcgcccg ggcggcgatc tacgcccagc actgtgacat ctacctggct 540 
ccgacatggg acaacagcga cacgtgggta gccacgttgc gtcacatcgc caaggagggg 600 
cggcagttcg tcatcggcgt cgccccgctg ctgcgcggct ccgacgtacc ggaggacctc 660 
cgcggcacgc tctacgggct gtcggacgac tggatgtcgc gcggctacac caccatcgtc 720 
gcaccaagcg gcgaggtgat cgccggcccg gtcctggagc gtgaggagat cctcttcgcg *7 ftn 
gacctcgacc tggccgacgt gcaggagcag agaaggatgt tcgaccctgt cggccactac 
tcacgacccg acgtcttcac gctccacgtc gacgcacgac cgaagagccc ggtcgtcttc 
gagagggatg caccgacctg a 

<210> 290 

<211> 306 

<212> PRT 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 290 

Met Val Thr Val Ala Ala Val Gin Ala Thr Pro Val Phe Leu Asp Arg 

15 10 15 

Glu Ala Thr Ser Asp Lys Val Cys Ala Leu Val Lys Glu Ala Ala Gly 

20 25 30 

His Gly Ala Glu Leu lie Val Phe Pro Glu Ser Phe Val Pro Ala Tyr 

35 40 45 

Pro Asp Trp Val Trp Arg Thr Pro Ala Trp Ser Asp Thr Glu Phe Val 

50 55 60 

Lvs Arq Phe Tyr Ala Asn Ala Val Thr Val Pro Gly Ala Thr Leu Glu 
65 70 75 80 

Arq lie Gly Ala Ala Ala Ala Glu Ala Glu Ala Tyr Val Val He Gly 

85 90 95 

Val Thr Glu He Asp Gly Gly Thr Leu Tyr Asn Thr Leu Leu Tyr Leu 

100 105 HO 

Gly Pro Asp Gly Gin Leu Leu Gin Arg His Arg Lys Leu Met Pro Thr 

115 120 125 

Gly Gly Glu Arg Thr Val Trp Gly Met Gly Asp Gly Ser Glu Leu Asp 

130 135 140 

Val Val Ser Thr Pro Phe Gly Val Val Gly Gly Leu Leu Cys Trp Glu 
145 150 155 160 

Asn Tyr Met Pro Leu Ala Arg Ala Ala He Tyr Ala Gin His Cys Asp 

165 170 175 

He Tyr Leu Ala Pro Thr Trp Asp Asn Ser Asp Thr Trp Val Ala Thr 

180 185 190 

Leu Arg His He Ala Lys Glu Gly Arg Gin Phe Val He Gly Val Ala 

195 200 205 

Pro Leu Leu Arg Gly Ser Asp Val Pro Glu Asp Leu Arg Gly Thr Leu 

210 215 220 

Tvr Gly Leu Ser Asp Asp Trp Met Ser Arg Gly Tyr Thr Thr He Val 
225 230 235 240 

Ala Pro Ser Gly Glu Val He Ala Gly Pro Val Leu Glu Arg Glu Glu 

245 250 255 

He Leu Phe Ala Asp Leu Asp Leu Ala Asp Val Gin Glu Gin Arg Arg 

260 265 270 

Met Phe Asp Pro Val Gly His Tyr Ser Arg Pro Asp Val Phe Thr Leu 

275 280 285 

His Val Asp Ala Arg Pro Lys Ser Pro Val Val Phe Glu Arg Asp Ala 

290 295 300 

Pro Thr 
305 ' 
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480 
540 
600 



<210> 291 
<211> 1002 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 291 

atgatgaaaa caactgttac cgttgcctgc gttcaggccg cccccgtatt tatggattta 60 

gaaggcacca tagataaaac gatcaccctc atctctgaag ccgcacagaa aggcgcggag 120 

ctcatcgctt ttccggagac ctggataccc ggttacccgt ggttcttatg gctgaactcg 180 

cccgcgacaa atatgcccct ggtttatcaa tatcatcaga actctctggt gctggacagt 240 

gcccaggcga agcgaattgc ggatgctgca cagcagaata acatcactgt cgttctggga 300 

ttcagcgagc gcgatcatgg aagcctctat atctcacagt ggctgattgg cagcgacggg 360 

gaaactattg gcatccggcg caagctcaag gccacacacg tggagcgtac gctgttcggc 420 
gaaagcgacg gctcctccct gaccacctgg gagacacctc tgggtaacgt cggggccctc 
tgctgctggg agcacctgca gccgctgtct cgctatgcga tgtattccca gcatgaagag 
atccatatcg ctgcgtggcc cagcttcagt ctttacacca gcgcaactgc cgcgctcgga 

cctgacgtca acacggcggc ttcacgcctc tatgccgcgg aggggcagtg cttcgtgtta 660 

gccccatgtg ccgtggtttc tgatgagatg attgatttac tctgtcctga tgatgaccgg 720 

agagcgttac tcagtgccgg agggggacat gcccgtattt acggacctga tggaagagaa 780 

ctcgtcaccc ctctcgggga aaatgaggaa ggactgctta tcgctgagct cgactctgct 840 

gcgatcacct ttgccaaact ggcggcagat cccgtaggcc actattcccg ccctgacgtg 900 

acccgccttc tttttaatcc ttcagccaac aagactgtta ttaaacggca ttcgcctcct 960 

gagttaattg ccgaacaggc tccggaagaa gaggaggagt ag 1002 

<210> 292 

<211> 333 

<212> PRT 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 292 

Met Met Lys Thr Thr Val Thr Val Ala Cys Val Gin Ala Ala Pro Val 

15 10 15 

Phe Met Asp Leu Glu Gly Thr lie Asp Lys Thr He Thr Leu He Ser 

20 25 30 

Glu Ala Ala Gin Lys Gly Ala Glu Leu He Ala Phe Pro Glu Thr Trp 

35 40 45 

He Pro Gly Tyr Pro Trp Phe Leu Trp Leu Asn Ser Pro Ala Thr Asn 

50 55 60 

Met Pro Leu Val Tyr Gin Tyr His Gin Asn Ser Leu Val Leu Asp Ser 
65 70 75 80 

Ala Gin Ala Lys Arg He Ala Asp Ala Ala Gin Gin Asn Asn He Thr 

85 90 95 

Val Val Leu Gly Phe Ser Glu Arg Asp His Gly Ser Leu Tyr He Ser 

100 105 110 

Gin Trp Leu He Gly Ser Asp Gly Glu Thr He Gly He Arg Arg Lys 

115 120 125 

Leu Lys Ala Thr His Val Glu Arg Thr Leu Phe Gly Glu Ser Asp Gly 

130 135 140 

Ser Ser Leu Thr Thr Trp Glu Thr Pro Leu Gly Asn Val Gly Ala Leu 
145 150 155 160 

Cys Cys Trp Glu His Leu Gin Pro Leu Ser Arg Tyr Ala Met Tyr Ser 

165 170 175 

Gin His Glu Glu lie His He Ala Ala Trp Pro Ser Phe Ser Leu Tyr 

180 185 190 

Thr Ser Ala Thr Ala Ala Leu Gly Pro Asp Val Asn Thr Ala Ala Ser 
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195 






200 








205 






Arg 


Leu 


Tyr 


Ala 


Ala 


Glu Gly Gin 


Cys 


Phe 


Val Leu 


Ala 


Pro Cys 


Ala 


210 








215 






220 








Val 


Val 


Ser 


Asp 


Glu 


Met lie Asp 


Leu 


Leu 


Cys Pro 


Asp 


Asp Asp 


Arg 


225 








230 






235 






240 


Arg 


Ala 


Leu 


Leu 


Ser 


Ala Gly Gly 


Gly 


His 


Ala Arg 


He 


Tyr Gly 


Pro 








245 






250 






255 




Asp Gly 


Arg 


Glu 


Leu 


Val Thr Pro 


Leu 


Gly 


Glu Asn 


Glu 


Glu Gly 


Leu 






260 






265 








270 




Leu 


lie 


Ala 
275 


Glu 


Leu 


Asp Ser Ala 
2B0 


Ala 


He 


Thr Phe 


Ala 
285 


Lys Leu 


Ala 


Ala 


Asp 


Pro 


Val Gly 


His Tyr Ser 


Arg 


Pro 


Asp Val 


Thr 


Arg Leu 


Leu 




290 








295 






300 








Phe 


Asn 


Pro 


Ser 


Ala 


Asn Lys Thr 


Val 


He 


Lys Arg 


His 


Ser Pro 


Pro 


305 










310 






315 






320 


Glu 


Leu 


lie 


Ala 


Glu 
325 


Gin Ala Pro 


Glu 


Glu 
330 


Glu Glu 


Glu 







<210> 293 
<2U> 1008 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 293 

atgaaaaata 

agtccggtgt 

ctaggaaaac 

tatttctctt 

caatcagtta 

aatatggtag 

ctcctttttg 

catgaaagaa 

gttgggcgta 

ttgattgctg 

ttgcacggtg 

gtcgtagtgg 

ggtggaccaa 

caactaatcg 

gcacaaattg 

ttgttgagct 

actgagccgg 



tcaaaaactc 
tgtacaaccg 
aaggggtaca 
ttattcaggc 
ctgttccgtc 
tgtctattgg 
atgctgacgg 
tgatttgggg 
tcgggcagtt 
atggagaaca 
aacaaaccga 
ctaccggttg 
tcggaccaat 
gggaagccct 
atgcccgcaa 
tgcatatcga 
gattagcaaa 



agaaaaaagc 

cgaagctacc 

attcgccact 

gccttatgcc 

agccgcgacc 

tgtcaatgaa* 

aacattaatt 

acaaggtgac 

ggcttgttgg 

aatccattct 

aatcaatgta 

gttggatgcc 

ttcgggaggt 

tacatcaggt 

aagattaatg 

tcacactccg 

aagacaaaat 



agcacagtaa 
gttcaaaaag 
tttccggaaa 
atgggcaaag 
gatgccataa 
cgagacggtg 
cagcgcagac 
gcttcaggtc 
gaacattaca 
gccatgtatc 
cgccaacacg 
gatcaacaag 
tgttttacag 
gaaggggaag 
gatgccagtg 
actgctccta 
gaaaattcat 



gagtcgctgc 
tagtcaacaa 
cgatagtgcc 
aacacctgcg 
gtgaggcggc 
gtaccattta 
gtaaacttac 
ttcgtgccac 
atccattgtt 
ccggatcatt 
ctttagaatc 
cacaaattgc 
ccgttatagg 
tgattgccga 
gtcactacaa 
tgcatgaaag 
caaattaa 



ggtacaaatc 
aatccttgaa 
ttattatcct 
cttgcttgaa 
aaaggaagcc 
caatacgcaa 
accaacgtat 
agacagcgct 
ccgttatgct 
tttaggtgcg 
ggccagcttc 
gaaagacacc 
ccctgacgga 
tattgatttg 
ccgtcctgaa 
agtagtttac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900* 
960 
1008 



<210> 294 
<211> 335 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 294 

Met Lys Asn He Lys Asn Ser Glu Lys Ser Ser Thr Val Arg Val Ala 

15 10 15 

Ala Val Gin He Ser Pro Val Leu Tyr Asn Arg Glu Ala Thr Val Gin 

20 25 30 

Lys Val Val Asn Lys He Leu Glu Leu Gly Lys Gin Gly Val Gin Phe 

35 40 45 

Ala Thr Phe Pro Glu Thr He Val Pro Tyr Tyr Pro Tyr Phe Ser Phe 
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50 55 60 

He Gin Ala Pro Tyr Ala Met Gly Lys Glu His Leu Arg Leu Leu Glu 
65 70 75 80 

Gin Ser Val Thr Val Pro Ser Ala Ala Thr Asp Ala He Ser Glu Ala 

85 90 95 

Ala Lys Glu Ala Asn Met Val Val Ser He Gly Val Asn Glu Arg Asp 

100 105 HO 

Gly Gly Thr He Tyr Asn Thr Gin Leu Leu Phe Asp Ala Asp Gly Thr 

115 120 125 

Leu He Gin Arg Arg Arg Lys Leu Thr Pro Thr Tyr His Glu Arg Met 

130 135 140 

He Trp Gly Gin Gly Asp Ala Ser Gly Leu Arg Ala Thr Asp Ser Ala 
145 150 155 160 

Val Gly Arg He Gly Gin Leu Ala Cys Trp Glu His Tyr Asn Pro Leu 

165 170 175 

Phe Arg Tyr Ala Leu He Ala Asp Gly Glu Gin He His Ser Ala Met 

180 185 190 

Tyr Pro Gly Ser Phe Leu Gly Ala Leu His Gly Glu Gin Thr Glu He 

195 200 205 

Asn Val Arg Gin His Ala Leu Glu Ser Ala Ser Phe Val Val Val Ala 

210 215 220 

Thr Gly Trp Leu Asp Ala Asp Gin Gin Ala Gin He Ala Lys Asp Thr 
225 230 235 240 

Gly Gly Pro He Gly Pro He Ser Gly Gly Cys Phe Thr Ala Val He 

245 250 255 

Gly Pro Asp Gly Gin Leu He Gly Glu Ala Leu Thr Ser Gly Glu Gly 

260 265 270 

Glu Val He Ala Asp He Asp Leu Ala Gin He Asp Ala Arg Lys Arg 

275 280 285 

Leu Met Asp Ala Ser Gly His Tyr Asn Arg Pro Glu Leu Leu Ser Leu 

290 295 300 

His He Asp His Thr Pro Thr Ala Pro Met Hi3 Glu Arg Val Val Tyr 
305 310 315 320 

Thr Glu Pro Gly Leu Ala Lys Arg Gin Asn Glu Asn Ser Ser Asn 
325 330 335 

<210> 295 
<211> 1134 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 295 

atggccgcaa aagtacttgg aggacgtgac acagtaaaag tagcagttgt tcaaacacca 

tcagtcttta tggacaagaa agcctgtctc gaacttgcct gcgataagat tatcgaagcc 120 

ggcaaagagg gcgcggagct tgttgttttt cctgaaacct ggattccgac atatccttat 180 

tggaccatgg gatgggatac catcgcccat ggcttccatg atgtgatggc ggacctgcag 240 

gacaattccg tggtcgtcgg gagcgaagac accgacatat tgggcaaagc cgcccgggaa 300 

gctggcgcct acgtcgtcat gggctgtaat gagctcgatg accgggtcgg cagcaggacc 360 

ttgttcaact cgctggtcta tatggacaaa tatggcggcg tgctcggccg tcaccgtaaa 420 

ttaatcccgt cctttatcga acgcatctgg tggggcaatg gggacagccg cgatctcaaa 480 

gtttacgaca cggaaattgg gcgcatcggc ggtcaaatct gctgggaaaa tcacattgtg 540 

aacatcaccg catggtacat tgcccaaggc gtggatattc atgtctcggt ttggccggga 600 

atgtggaact gtggcgcaga agaaggagaa tccttcatat acgccggtca cgacatcaac 660 

aaatgtgacc tcatccctgc tacacgcgga cgggcattta cgggtcagtg ttatgtcctc 720 

tcagccaaca acctactgcg gatggaagac attcctgacg atttcccgtt ccgggatacc 780 

atgaactatg gcggtccggg ccaggaggat tttgtcggat gggcttgcgg tggcagccat 840 

attgttgcgc caacgtctga atttatggtg ccgccgacgt ttgatataga caccatcatc 900 

tatgcagaac ttcaggcgaa atacatcaaa gtggtgaagt cggtcttcga ttccctcggc 960 



60 
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cactacgcgc ggtgggacct cgtcaacttg accacgccgc caccgccgta tgaacctgaa 1020 
accgacgcac cagctctcac ggccgatatc cgtgatcggg tcatcgagag tgtggctaaa 1080 
gagttcaagc tcgaaccaga aaaagtggct gaagttgtgc gcaatgccgc ctag 1134 

<210> 296 
<211> 377 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 296 

Met Ala Ala Lys Val Leu Gly Gly Arg Asp Thr Val Lys Val Ala Val 

15 10 15 

Val Gin Thr Pro Ser Val Phe Met Asp Lys Lys Ala Cys Leu Glu Leu 

20 25 30 

Ala Cys Asp Lys He He Glu Ala Gly Lys Glu Gly Ala Glu Leu Val 

35 40 45 

Val Phe Pro Glu Thr Trp He Pro Thr Tyr Pro Tyr Trp Thr Met Gly 

50 55 60 

Trp Asp Thr He Ala His Gly Phe His Asp Val Met Ala Asp Leu Gin 
65 70 75 80 

Asp Asn Ser Val Val Val Gly Ser Glu Asp Thr Asp He Leu Gly Lys 

85 90 95 

Ala Ala Arg Glu Ala Gly Ala Tyr Val Val Met Gly Cys Asn Glu Leu 

100 105 HO 

Asp Asp Arg Val Gly Ser Arg Thr Leu Phe Asn Ser Leu Val Tyr Met 

115 120 125 

Asp Lys Tyr Gly Gly Val Leu Gly Arg His Arg Lys Leu He Pro Ser 

130 135 140 

Phe He Glu Arg He Trp Trp Gly Asn Gly Asp Ser Arg Asp Leu Lys 
145 150 155 160 

Val Tyr Asp Thr Glu He Gly Arg He Gly Gly Gin He Cys Trp Glu 

165 170 175 

Asn His He Val Asn He Thr Ala Trp Tyr lie Ala Gin Gly Val Asp 

180 185 190 

He His Val Ser Val Trp Pro Gly Met Trp Asn Cys Gly Ala Glu Glu 

195 200 205 

Gly Glu Ser Phe He Tyr Ala Gly His Asp He Asn Lys Cys Asp Leu 

210 215 220 

He Pro Ala Thr Arg Gly Arg Ala Phe Thr Gly Gin Cys Tyr Val Leu 
225 230 235 240 

Ser Ala Asn Asn Leu Leu Arg Met Glu Asp He Pro Asp Asp Phe Pro 

245 250 255 

Phe Arg Asp Thr Met Asn Tyr Gly Gly Pro Gly Gin Glu Asp Phe Val 

260 265 270 

Gly Trp Ala Cys Gly Gly Ser His He Val Ala Pro Thr Ser Glu Phe 

275 280 285 

Met Val Pro Pro Thr Phe Asp He Asp Thr He He Tyr Ala Glu Leu 

290 295 30° 

Gin Ala Lys Tyr He Lys Val Val Lys Ser Val Phe Asp Ser Leu Gly 
305 310 315 320 

His Tyr Ala Arg Trp Asp Leu Val Asn Leu Thr Thr Pro Pro Pro Pro 

325 330 335 

Tyr Glu Pro Glu Thr Asp Ala Pro Ala Leu Thr Ala Asp He Arg Asp 

340 345 350 

Arg Val He Glu Ser Val Ala Lys Glu Phe Lys Leu Glu Pro Glu Lys 

355 360 365 

Val Ala Glu Val Val Arg Asn Ala Ala 
370 375 
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<210> 297 
<211> 1059 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 297 

atgacacgtt ttcgggacgt cacggtggcg gcggttcagg ccgcacccgt ctatttcgat 60 

cgggaggcct ccacagataa ggcgtgccaa ttgattcacg aagcggcgaa gaaaggcgca 120 

gccctcgcgg cgttcggcga aacgtggttg ccgggatatc cgttctttgc atgggggttc 180 

gcgcacaacc ggagcctgtt ctggaatgcg gccgccgagt acatcgccaa tgctgtggag 240 

attccgagtc caacgacgga ccgcctctgc gccgcagcga agatcgccgg gattgacgtg 300 

gtaatcggcg tcgtagaact ggatggacga acgcgagcgt cggtttacag cacactgctg 360 

ttcatcggga gagagggggc gatcctgggg cgccaccgca aattgaagcc aacccacatg 420 

gagcgaacgg tgtggggtga aggggacgct cacgggctcc gcgttcacga gcgtccgtac 480 

ggccgcctca gcgggctgaa ttgctgggaa cacaacatga tgctgcccgg ctatgtgctt 540 

gccgcgcagg gcacgcagtt tcacgtcgcg acatggcctg ggaaagagag gctcacagtt 600 

ccgccgaacg aggcggctta tacgcgccag cttctcctct ctcgcgccta tgcatcccag 660 

gcgggcgcgt acgtgatcag cgtcgcgggg ctgctcggac ccgactcgat gccggagcgt 720 

tatcgcgaac tgggacagtc ctatgagttg accggcgaca gcgtcatcat cgatccgcgc 780 

ggcgaggtca tcgcggggcc cgcgaagggc gagaccatcc tgctcgcgca atgcagccag 840 

gaagccctct tcgccgcaaa gtccgccatt gacgtcggcg gtcattactc gcgcccggat 900 

atttttcagc tgcgtgtcaa cgatcagcta cagcatcagg tccggagact cgaggcgact 960 

ctcacgcccc cagtcgccgt agttgtcggg cctgagggta gctcacatga gcaggagacg 1020 

gccttcgggc catccagcct cctggccacg acaagctag 1059 

<210> 298 
<211> 352 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 298 



Met 


Thr 


Arg 


Phe 


Arg 


Asp 


Val 


Thr Val Ala 


Ala Val Gin Ala Ala Pro 


1 






5 






10 


15 


Val 


Tyr 


Phe 


Asp 


Arg 


Glu 


Ala 


Ser Thr Asp 


Lys Ala Cys Gin Leu He 






20 








25 


30 


His 


Glu 


Ala 


Ala 


Lys 


Lys 


Gly 


Ala Ala Leu 


Ala Ala Phe Gly Glu Thr 






35 










40 


45 


Trp 


Leu 


Pro 


Gly 


Tyr 


Pro 


Phe 


Phe Ala Trp 


Gly Phe Ala His Asn Arg 


50 










55 




60 


Ser 


Leu 


Phe 


Trp 


Asn 


Ala 


Ala 


Ala Glu Tyr 


He Ala Asn Ala Val Glu 


65 








70 






75 80 


lie 


Pro 


Ser 


Pro 


Thr 


Thr 


Asp 


Arg Leu Cys 


Ala Ala Ala Lys He Ala 










85 






90 


95 


Gly 


He 


Asp 


val 


Val 


He 


Gly 


Val Val Glu 


Leu Asp Gly Arg Thr Arg 




100 








105 


110 


Ala 


Ser 


Val 


Tyr 


Ser 


Thr 


Leu 


Leu Phe He 


Gly Arg Glu Gly Ala He 






115 








120 


125 


Leu 


Gly 


Arg 


His 


Arg 


Lys 


Leu 


Lys Pro Thr 


His Met Glu Arg Thr Val 




130 










135 




140 


Trp 


Gly 


Glu 


Gly 


Asp 


Ala 


His 


Gly Leu Arg 


Val His Glu Arg Pro Tyr 


145 






150 






155 160 


Gly 


Arg 


Leu 


Ser 


Gly 


Leu 


Asn 


Cys Trp Glu 


His Asn Met Met Leu Pro 






165 






170 


175 


Gly 


Tyr 


Val 


Leu 


Ala 


Ala 


Gin 


Gly Thr Gin 


Phe His Val Ala Thr Trp 
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180 




185 




190 




Pro Gly 


Lys Glu Arg Leu Thr 


Val 


Pro 


Pro Asn Glu 


Ala Ala Tyr 


Thr 






195 


200 






205 




Arg 


Gin 


Leu Leu Leu Ser Arg Ala 


Tyr 


Ala Ser Gin 


Ala Gly Ala 


Tyr 


210 


215 






220 






Val 


He 


Ser Val Ala Gly Leu 


Leu 


Gly 


Pro Asp Ser 


Met Pro Glu 


Arg 


225 




230 






235 




240 


Tyr 


Arg 


Glu Leu Gly Gin Ser 


Tyr 


Glu 


Leu Thr Gly 


Asp Ser Val 


He 


245 






250 


255 




lie Asp 


Pro Arg Gly Glu Val 


He 


Ala 


Gly Pro Ala 


Lys Gly Glu 


Thr 






260 




265 




270 




He 


Leu 


Leu Ala Gin Cys Ser 


Gin 


Glu 


Ala Leu Phe 


Ala Ala Lys 


Ser 






275 


280 






285 




Ala 


He 


Asp Val Gly Gly His Tyr 


Ser 


Arg Pro Asp 


He Phe Gin 


Leu 




290 


295 






300 






Arg 


Val 


Asn Asp Gin Leu Gin 


His 


Gin 


Val Arg Arg 


Leu Glu Ala 


Thr 


305 




310 






315 




320 


Leu 


Thr 


Pro Pro Val Ala Val 


Val 


Val 


Gly Pro Glu 


Gly Ser Ser 


His 




325 






330 


335 




Glu 


Gin 


Glu Thr Ala Phe Gly 


Pro 


Ser 


Ser Leu Leu 


Ala Thr Thr 


Ser 






340 




345 




350' 





<210> 299 
<211> 987 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 299 

atgactgttg ttaaggccgc cgcagtgcag atcagcccgg tgctctacag ccgtgcagga 60 

accgtcgaga aggtcgtgaa gaagatcgac gagctgggcc agaagggtgt cgagtttgcc 120 

gtcttccctg aaaccgttgt cccctactac ccctacttct ccttcgtgca gcccccctac 180 

aaactggcca cggagcacct gcgcctgctt gaggagtcgg tgaccgtgcc ctctgccgag 240 

acggacgcca tcggcgacgc cgcccgcaag gccaacatgg tcgtctcgat cggtgtcaac 300 

gaacgtgatg gcggcaccat ttacaacacc caactcctgt tcgacgccga cggaaccctg 360 

atccagcgcc gccgcaagat cacgccgacc taccacgagc ggatgatctg gggacaggga 420 

gacggatcag gcttgcgcgc ggtcgacagc gtcgtcggcc gcatcggcca gctcgcctgc 480 

tgggagcact accagccgct ggcccgttac gctctcatcg ccgacggcga gcagatccac 540 

gccgcgatgt accccggcgc cttcgggggc gatctgttcg cagagcagat cgaagtcaat 600 

gtccgtcagc acgctctgga atcggccagt ttcgtcgtca gcgccaccgc ctggctcgac 660 

gccgaccagc aggcccagat tgcgaaggac accggcggcc ccgtacaggc gatctccggc 720 

ggcttcttca cagccatcat cgaccccgac ggccgcatca tcggcgaacc gatcacctcc 780 

ggcgaaggcg aagtcatcgc tgacctcgac tttgcgctca tcgaccgccg caagcgcctg 840 

atggacgcca gcggccacta cagccgcccc gaactgctca gcctgcagat cgaccggacg 900 

ccggcacccg ccgtccacga tcgcaatcgc caggggtcct caagcgctcc ggcaactgaa 960 

aagggccgct cagccgaggc caagtga 987 



<210> 300 
<211> 328 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 300 

Met Thr Val Val Lys Ala Ala Ala Val Gin He Ser Pro Val Leu Tyr 

I 5 10 15 

Ser Arg Ala Gly Thr Val Glu Lys Val Val Lys Lys He Asp Glu Leu 
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20 25 30 

Gly Gin Lys Gly Val Glu Phe Ala Val Phe Pro Glu Thr Val Val Pro 

35 40 45 

Tyr Tyr Pro Tyr Phe Ser Phe Val Gin Pro Pro Tyr Lys Leu Ala Thr 

50 55 60 

Glu His Leu Arg Leu Leu Glu Glu Ser Val Thr Val Pro Ser Ala Glu 
65 70 75 80 

Thr Asp Ala He Gly Asp Ala Ala Arg Lys Ala Asn Met Val Val Ser 

85 90 95 

He Gly Val Asn Glu Arg Asp Gly Gly Thr He Tyr Asn Thr Gin Leu 

100 105 110 

Leu Phe Asp Ala Asp Gly Thr Leu He Gin Arg Arg Arg Lys He Thr 

115 120 125 

Pro Thr Tyr His Glu Arg Met He Trp Gly Gin Gly Asp Gly Ser Gly 

130 135 140 

Leu Arg Ala Val Asp Ser Val Val Gly Arg He Gly Gin Leu Ala Cys 
145 150 155 160 

Trp Glu His Tyr Gin Pro Leu Ala Arg Tyr Ala Leu He Ala Asp Gly 

165 170 175 

Glu Gin He His Ala Ala Met Tyr Pro Gly Ala Phe Gly Gly Asp Leu 

180 185 190 

Phe Ala Glu Gin He Glu Val Asn Val Arg Gin His Ala Leu Glu Ser 

195 200 205 

Ala Ser Phe Val Val Ser Ala Thr Ala Trp Leu Asp Ala Asp Gin Gin 

210 215 220 

Ala Gin He Ala Lys Asp Thr Gly Gly Pro Val Gin Ala He Ser Gly 
225 230 235 240 

Gly Phe Phe Thr Ala He He Asp Pro Asp Gly Arg He He Gly Glu 

245 250 255 

Pro He Thr Ser Gly Glu Gly Glu Val He Ala Asp Leu Asp Phe Ala 

260 265 270 

Leu He Asp Arg Arg Lys Arg Leu Met Asp Ala Ser Gly His Tyr Ser 

275 280 285 

Arg Pro Glu Leu Leu Ser Leu Gin He Asp Arg Thr Pro Ala Pro Ala 

290 295 300 

Val His Asp Arg Asn Arg Gin Gly Ser Ser Ser Ala Pro Ala Thr Glu 
305 310 315 320 

Lys Gly Arg Ser Ala Glu Ala Lys 
325 

<210> 301 
<211> 1032 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 301 

atgaccacgc ccgcgccgtt caccgccgcc gtcgtccagg cgtcgcccgt gttcctcgac 60 

cgcgatgcca cggtcgataa ggcgtgcgcc ctgatcaccg cggccggagc gcgcggcgcg 120 

aagctcgtcg tgctgcccga gaccttcgtg cccgcatacc ccgcgtgggt ctggtacctg 180 

ccgctcacgc gtcggcccga cgtcgcgaag ctctaccgcg cgctcgtcga gaacgcgatc 240 

gacgtgccgg gtccggaaac cgagcgcctc gggcgcgcgg cgcgtgacgc gggcgcgtgg 300 

gtggcgatcg gtgtcaacga gcgcaacgcg aaggcgagcc gcacgtcgct ctacaacacc 360 

gtgctcctgt tcgacgacca gggcacgctg gtcgagtcgc ggcgcaagct gatgccgacc 420 

ggcggcgagc gcctggtgtg gacgccgggc gagccggtgc cgttgcgggt tcacgacacg 4 80 

ccgctcggcc gcgtcggcgc gctgatctgc tgggagaact acatgccgct cgcgcgcttc 540 

gcgctctacg agcagggcgt ccagatctat ctcgcgccga cctgggacta cagcgaggcg 600 

tggctcgcgt ccatgcggca cgtcgcgcgt gaaggtcgca cgtgggtgat cgggtgcagc 660 

caggcggtgc ggcgtgacga gatcccggac cacctgccgt tcaaggacgc gatccccgag 720 
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agcctcgagt ggatcaacgc cgggaacagc atcgtcgtcg atccggacgg cgcggtggtc 780 

gcgggaccgc tggcgcgggc gcacgacacg ctctacgtcg agatcgatcc gggccgcgcc 840 

gccggctccc gatggatctt cgacgccgcc ggtcactatc accgtcccga cctgttccat 900 

ttcgcgatgc gcgcgccggt ggcgccagag accgcgccgg ttgtgcggaa gcgcggccgg 960 

cggctagcct cggccccggc gcggaagcca cggaaacggc cggcgcgtcc cacccgaagg 1020 

aggactcgat ga 1032 

<210> 302 
<211> 335 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 302 

Met Thr Thr Pro Ala Pro Phe Thr Ala Ala Val Val Gin Ala Ser Pro 

15 10 15 

Val Phe Leu Asp Arg Asp Ala Thr Val Asp Lys Ala Cys Ala Leu lie 

20 25 30 

Thr Ala Ala Gly Ala Arg Gly Ala Lys Leu Val Val Leu Pro Glu Thr 

35 40 45 

Phe Val Pro Ala Tyr Pro Ala Trp Val Trp Tyr Leu Pro Leu Thr Arg 

50 55 60 

Arg Pro Asp Val Ala Lys Leu Tyr Arg Ala Leu Val Glu Asn Ala lie 
65 70 75 80 

Asp Val Pro Gly Pro Glu Thr Glu Arg Leu Gly Arg Ala Ala Arg Asp 

85 90 95 

Ala Gly Ala Trp Val Ala lie Gly Val Asn Glu Arg Asn Ala Lys Ala 

100 105 110 

Ser Arg Thr Ser Leu Tyr Asn Thr Val Leu Leu Phe Asp Asp Gin Gly 

115 120 125 

Thr Leu Val Glu Ser Arg Arg Lys Leu Met Pro Thr Gly Gly Glu Arg 

130 135 140 

Leu Val Trp Thr Pro Gly Glu Pro Val Pro Leu Arg Val His Asp Thr 
145 150 155 160 

Pro Leu Gly Arg Val Gly Ala Leu lie Cys Trp Glu Asn Tyr Met Pro 

165 170 175 

Leu Ala Arg Phe Ala Leu Tyr Glu Gin Gly Val Gin lie Tyr Leu Ala 

180 185 190 

Pro Thr Trp Asp Tyr Ser Glu Ala Trp Leu Ala Ser Met Arg His Val 

195 200 205 

Ala Arg Glu Gly Arg Thr Trp Val lie Gly Cys Ser Gin Ala Val Arg 

210 215 220 

Arg Asp Glu lie Pro Asp His Leu Pro Phe Lys Asp Ala lie Pro Glu 
225 230 235 240 

Ser Leu Glu Trp lie Asn Ala Gly Asn Ser lie Val Val Asp Pro Asp 

245 250 255 

Gly Ala Val Val Ala Gly Pro Leu Ala Arg Ala His Asp Thr Leu Tyr 

260 265 270 

Val Glu lie Asp Pro Gly Arg Ala Ala Gly Ser Arg Trp lie Phe Asp 

275 280 285 

Ala Ala Gly His Tyr His Arg Pro Asp Leu Phe His Phe Ala Met Arg 

290 295 300 

Ala Pro Val Ala Pro Glu Thr Ala Pro Val Val Arg Lys Arg Gly Arg 
305 310 315 320 

Arg Leu Ala Ser Ala Pro Ala Arg Lys Pro Arg Lys Arg Pro Ala 
325 330 335 

<210> 303 
<211> 1011 



Page 198 



WO 03/097810 



PCTAJS03/15712 



<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 303 

atgggcatcg ttcaccccaa gtacaaagtc gctgtcgttc aggctgcccc tgtatggcta 60 

gacctggagg ccactgttga taaatgcatt cagttgattg aagaagcagc cagcaagggc 120 

tgcaagctca tcgccttccc tgagaccttc attcccggat acccttggta tatctggatg 180 

ggaacgcctg cctggactat tcagcgcggc tttgtacagc gctattttga taattctctg 240 

tcttatgaca gtccgcaagc ggagaagcta aggcaggcag tcaaaaaggc tgagatcacg 300 

gccgtactag gtctttctga gcgcagcggc ggtagcttgt acattgcaca atggaccatt 360 

ggccctgacg gagaaaccat acacaaacgc agaaaagtgc gtccaacgca tggtgagcgt 420 

acggtatttg gcgacggtga cggtagtgat cttgcggtgc acgatacccc cctggggcgg 480 

ctgggcgcgc ttgcgtgctg ggagaacata ctgtcactga acaagtatgc gatgtattca 540 

cagaatgagc aggtgcacgt agccgcttgg cctagcttct cggtctacga gcctttcgcc 600 

catgcattgg gttgggaggt caataacgca gtcagcaagg tttacgcggt agaaggcggc 660 

tgttttgtat tggcgccttg cgcagtggtc tccgaggaaa tgatcgaagc actgtgcgat 720 

acacccgata agcaccaact ggctcatgcg ggtggagggc atgctgtcat ttacggacca 780 

gatggcagtc ctctggcaga taagttaccc gaaggagagg aggggctatt aattgcagaa 840 

attgatctcg gtctcatcag cttggcgaaa aatgccatgg acccggtggg gcattactct 900 

cgacctgacg tacatcgctt gctattgaat cgcaatccag caaagcgggt tgaggaattt 960 

tctctgccca ttgatttggc agagacaact ccgccaatat taggcacgta g 1011 

<210> 304 
<211> 336 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 304 














Met Gly 


He 


Val 


His 


Pro 


Lys Tyr 


Lys Val Ala Val Val Gin Ala 


Ala 


1 








5 






10 15 




Pro 


Val 


Trp 


Leu Asp 


Leu 


Glu Ala 


Thr Val Asp Lys Cys He Gin 


Leu 






20 








25 30 




lie 


Glu 


Glu 


Ala 


Ala 


Ser 


Lys Gly 


Cys Lys Leu He Ala Phe Pro 


Glu 






35 








40 


45 




Thr 


Phe 


He 


Pro Gly 


Tyr 


Pro Trp 


Tyr He Trp Met Gly Thr Pro 


Ala 




50 










55 


60 




Trp 


Thr 


He 


Gin 


Arg 


Gly 


Phe Val 


Gin Arg Tyr Phe Asp Asn Ser 


Leu 


65 










70 




75 


80 


Ser 


Tyr 


Asp 


Ser 


Pro 


Gin 


Ala Glu 


Lys Leu Arg Gin Ala Val Lys Lys 






85 






90 95 




Ala 


Glu 


He 


Thr 


Ala 


Val 


Leu Gly 


Leu Ser Glu Arg Ser Gly Gly 


Ser 








100 








105 HO 




Leu 


Tyr 


He 


Ala 


Gin 


Trp 


Thr He 


Gly Pro Asp Gly Glu Thr He 


His 




115 








120 


125 




Lys 


Arg 


Arg 


Lys 


Val 


Arg 


Pro Thr 


His Gly Glu Arg Thr Val Phe 


Gly 


130 










135 


140 




Asp Gly 


Asp 


Gly Ser 


Asp 


Leu Ala 


Val His Asp Thr Pro Leu Gly Arg 


145 










150 




155 


160 


Leu 


Gly 


Ala 


Leu 


Ala 


Cys 


Trp Glu 


Asn He Leu Ser Leu Asn Lys 


Tyr 








165 


170 175 




Ala 


Met 


Tyr 


Ser 


Gin 


Asn 


Glu Gin 


Val His Val Ala Ala Trp Pro 


Ser 






180 








185 190 




Phe 


Ser 


val 


Tyr 


Glu 


Pro 


Phe Ala 


His Ala Leu Gly Trp Glu Val 


Asn 






195 








200 


205 




Asn 


Ala 


Val 


Ser 


Lys 


Val 


Tyr Ala 


Val Glu Gly Gly Cys Phe Val 


Leu 
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Z1U 








ax a 


Pro 


l*y£> nia Val VaJL 


Ser 


Glu 


225 










TVit 
1 tlx 


pro 


A on T.mq Hi ^ (51 n 


Leu 


Ala 






245 






lie 


Tyr 


Gly Pro Asp Gly 


Ser 


Pro 






260 






Glu 


Glu 


Gly Leu Leu He 


Ala 


Glu 






275 




280 


Ala 


Lys 


Asn Ala Met Asp 


Pro 


Val 




290 




295 




His 


Arg 


Leu Leu Leu Asn 


Arg Asn 


305 




310 






Ser 


Leu 


Pro He Asp Leu 


Ala 


Glu 



325 





oon 






uiu rice 


Tie* 

lie olu 


nict lieu \*y!> 


Asp 




01 ^ 




240 


nis AJ.a 


oxy wJ-j 


nxa rij.a 


v ai 






£. JJ 




Leu Ala 


Asp Lys 


Leu Pro Glu 


Gly 


265 




270 




He Asp 


Leu Gly 


Leu He Ser 


Leu 






285 




Gly His 


Tyr Ser 


Arg Pro Asp 


Val 




300 






Pro Ala 


Lys Arg 


Val Glu Glu 


Phe 




315 




320 


Thr Thr 


Pro Pro 


He Leu Gly 


Thr 


330 




335 





<210> 305 
<211> 1068 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 305 

atgacgatgc aacatcccaa gtttcgeget gctgccgtgc aggcggcacc ggtcttcctt 60 

gaccttgatg cgtcgataga taaagecate gacctgatcg cgcaagccgc caaaggegge 120 

gcgcaattga ttgcctttcc ggaaacctgg ctgcccggct accccttttt catctggctc 180 

gattegcegg cctggggcat gcaattcatc cagcgctacc atgacaattc cctggtctac 24 0 

ggcacaccgc aggecgageg catcgcgcag getgegaaaa ageategcat catggtcgtc 300 

atggggcaca gcgagcggga tcatggaagt ctgtacatcg ctcagtggat catcggtgcc 360 

gatggggaaa eggttgegae aegtegtaag ctcaaaccga ctcatgccga gcgcaccttg 420 

tttggggaag gcgatggcag tgacctgagc gtgttcgata caccgctggg aagggttggc 480 

geactatget gctgggagca cctccagccc ctgtcgaaat aegegctata cgcacagaat 540 

gagcaggtcc acattgette ctggccaagc ttttccctgt ategeggggg cgcctacgcg 600 

ctcggcgccg aagtgaacaa tgcggccagc cagatttatg ctgtcgaagg ccagtgtttt 660 

gtgatcgege cgtgcggggt cgtcacgaaa gaaatgctgg acgtgctgtg caccgacgaa 720 

atgaaaaagc agttgttggt tgaaggegge gggttcgcgc gaatttaege gcccgatgga 780 

cagatgatgc acgcgccgct ggeggaaaac gaagagggee tggtgtatgc cgatctcgac 84 0 

ctgggcatga tctcgctggc caaagtagtc gccgacccgg cegggcatta tgcgcggccc 900 

gacgtaaccc ggttactgct ggacaagact ccgggagacc gegtcatget ggcgagccgt 960 

cgeggcaagg aggtcagecg cgctggtaac gacgagccgc aagtgctggt ctcgcgcaac 1020 

gaeaegctga cctcgccgaa gccagcgtct tegegcaaag caagctga 1068 



<210> 306 
<211> 355 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 306 

Met Thr Met Gin His Pro Lys Phe 

1 5 
Pro Val Phe Leu Asp Leu Asp Ala 
20 

He Ala Gin Ala Ala Lys Gly Gly 

35 40 
Thr ' Trp Leu Pro Gly Tyr Pro Phe 
50 55 



Arg Ala Ala Ala Val Gin Ala Ala 

10 15 
Ser He Asp Lys Ala He Asp Leu 
25 30 
Ala Gin Leu He Ala Phe Pro Glu 
45 

phe He Trp Leu Asp Ser Pro Ala 
60 
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Trp Gly Met Gin Phe He Gin Arg Tyr His Asp Asn Ser Leu Val Tyr 
65 70 75 80 

Gly Thr Pro Gin Ala Glu Arg He Ala Gin Ala Ala Lys Lys His Arg 

85 90 95 

He Met Val Val Met Gly His Ser Glu Arg Asp His Gly Ser Leu Tyr 

100 105 110 

He Ala Gin Trp He He Gly Ala Asp Gly Glu Thr Val Ala Thr Arg 

115 120 125 

Arg Lys Leu Lys Pro Thr His Ala Glu Arg Thr Leu Phe Gly Glu Gly 

130 135 140 

Asp Glv Ser Asp Leu Ser Val Phe Asp Thr Pro Leu Gly Arg Val Gly 
145 150 155 160 

Ala Leu Cys Cys Trp Glu His Leu Gin Pro Leu Ser Lys Tyr Ala Leu 

165 170 175 

Tvr Ala Gin Asn Glu Gin Val His He Ala Ser Trp Pro Ser Phe Ser 

180 185 190 

Leu Tyr Arg Gly Gly Ala Tyr Ala Leu Gly Ala Glu Val Asn Asn Ala 

195 200 205 

Ala Ser Gin He Tyr Ala Val Glu Gly Gin Cys Phe Val He Ala Pro 

210 215 220 

Cvs Glv Val Val Thr Lys Glu Met Leu Asp Val Leu Cys Thr Asp Glu 
225 230 235 240 

Met Lys Lys Gin Leu Leu Val Glu Gly Gly Gly Phe Ala Arg He Tyr 

245 250 255 

Ala Pro Asp Gly Gin Met Met His Ala Pro Leu Ala Glu Asn Glu Glu 

260 265 270 

Gly Leu Val Tyr Ala Asp Leu Asp Leu Gly Met He Ser Leu Ala Lys 

275 280 285 

Val Val Ala Asp Pro Ala Gly His Tyr Ala Arg Pro Asp Val Thr Arg 
290 295 300 

• Leu Leu Leu Asp Lys Thr Pro Gly Asp Arg Val Met Leu Ala Ser Arg 
305 310 315 320 

Ara Glv Lys Glu Val Ser Arg Ala Gly Asn Asp Glu Pro Gin Val Leu 

325 330 335 

Val Ser Arg Asn Asp Thr Leu Thr Ser Pro Lys Pro Ala Ser Ser Arg 
340 345 350 

Lys Ala Ser 
355 

<210> 307 
<211> 942 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<400> 307 ^ vv 

atggctgaac cggaatcctt tatcgtcgct gctgtacagg ctacaccgat ctttcttgac 
cgccaggcaa cccttgagaa agcgtgcgac ctgattgccg aagctggcag caatggcgca 
aaactcgttc tttttcccga agcctttatc cccacctatc ctgattggat atgggcggtg 
acaggctcac aatctgcgct gctcgacgaa ctttatgtgg aactactgga aaactccgtg 
accatccccg acgcgaccac tgagcaactt tgtgaagcag cacgtaacgc cggtctctac 
gtcgtcatgg gagtgaatga gcgcaacgcc gaggcgagca acgccacact ctataacacc 
ctgctctata ttgacgatca gggcaaaatt ctcggcaagc atcgcaaatt ggtcccgacc 
gccctggagc gaatcgtctg gggctatggc gatggcagca cgcttgacgc ctttgaaacg 
ccgctgggca agattggcgg gctgatctgt tgggaaaatt acatgccact ggcgcgccaa 
acactttatg cctggggggt gcaaatttac ttggccgcaa cgtgggatcg cggcgaagtt 
tggcaggcga ccatgcgcca tattgccagg gaaggcggcg tctatgtagt cgcctcctgt 
attccatttc acatcaaaga cattcctgac cacatgcctg aaatccgcaa tctctatgca 
ccggggacag actggatcaa cgtcggccaa agctgcatca tcaaccccag cggcgactat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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attgcaggcc ctgtcgagtg tcgcgaggag attctttatg ccgaggtaaa tctgcgccag 84 0 
agtgcggcgg caaaacgtat gttggatgtg gcgggccatt atggacgccc tgatgtcttt 900 
cacctcaccg tcaaccgcac gcccaatccg catattcgat aa 942 

<210> 308 
<211> 313 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 308 

Met Ala Glu Pro Glu Ser Phe He Val Ala Ala Val Gin Ala Thr Pro 

I 5 10 15 

He Phe Leu Asp Arg Gin Ala Thr Leu Glu Lys Ala Cys Asp Leu He 

20 25 30 

Ala Glu Ala Gly Ser Asn Gly Ala Lys Leu Val Leu Phe Pro Glu Ala 

35 40 45 

Phe He Pro Thr Tyr Pro Asp Trp He Trp Ala Val Thr Gly Ser Gin 

50 55 60 

Ser Ala Leu Leu Asp Glu Leu Tyr Val Glu Leu Leu Glu Asn Ser Val 
65 70 75 80 

Thr He Pro Asp Ala Thr Thr Glu Gin Leu Cys Glu Ala Ala Arg Asn 

85 90 95 

Ala Gly Leu Tyr Val Val Met Gly Val Asn Glu Arg Asn Ala Glu Ala 

100 105 110 

Ser Asn Ala Thr Leu Tyr Asn Thr Leu Leu Tyr He Asp Asp Gin Gly 

115 120 125 

Lys He Leu Gly Lys His Arg Lys Leu Val Pro Thr Ala Leu Glu Arg 

130 135 140 

He Val Trp Gly Tyr Gly Asp Gly Ser Thr Leu Asp Ala Phe Glu Thr 
145 150 155 160 

Pro Leu Gly Lys He Gly Gly Leu He Cys Trp Glu Asn Tyr Met Pro 

165 170 175 

Leu Ala Arg Gin Thr Leu Tyr Ala Trp Gly Val Gin He Tyr Leu Ala 

180 185 190 

Ala Thr Trp Asp Arg Gly Glu Val Trp Gin Ala Thr Met Arg His He 

195 200 205 

Ala Arg Glu Gly Gly Val Tyr Val Val Ala Ser Cys He Pro Phe His 

210 215 220 

He Lys Asp He Pro Asp His Met Pro Glu He Arg Asn Leu Tyr Ala 
225 230 235 240 

Pro Gly Thr Asp Trp He Asn Val Gly Gin Ser Cys He He Asn Pro 

245 250 255 

Ser Gly Asp Tyr He Ala Gly Pro Val Glu Cys Arg Glu Glu He Leu 

260 265 270 

Tyr Ala Glu Val Asn Leu Arg Gin Ser Ala Ala Ala Lys Arg Met Leu 

275 280 285 

Asp Val Ala Gly His Tyr Gly Arg Pro Asp Val Phe His Leu Thr Val 

290 295 300 

Asn Arg Thr Pro Asn Pro His He Arg 
305 310 

<210> 309 
<211> 951 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
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<400> 309 

ttgaccgact gtctcaaaat agccgccgcc caaatcactc cggtcttcct cgaccgcgtt 60 

gcgaccacga agaaggtcgt cgaaaccatc gaaaaagcgg ccgccggcgg tgcgcggctg 120 

gtcgcattcg gcgaagcgct gttgcccgcc tatccattgt ggctgacgcg caccgacgcc 180 

gcgcggttca attccgacgt gcaaaaaaac ttgcacgcga tctatctcaa gcaatccgtc 240 

tcgatagcag gcggtcacct atctccgatt tgcaaaatcg caagcgaacg caagattgcc 300 

gtcatcctcg gcatcgccga gcgcgcgacc gaccggggcg accacaccat ttactgctcg 360 

tgcgtgttca tcgatgccga cggccgaatc gcgtcggtcc atcgcaagct gatgccgaca 420 

tacgaagaac gcctcagttg gggattcggc gacggtgcgg gactcgtcac gcatccggtc 480 

gggccgttca cggtgggcgc gttgaactgc tgggaaaact ggatgcccct cgcgcgcacc 540 

gcgctgtatg ccggcggaga agatttgcac gttgcgatct ggcccggcgg atcggtgctc 600 

acggaagaca tcacgcgctt catcgcacgc gagtcgcgct cgttcgtcct gtccgtcagc 660 

ggcatcattc gcgaaagcga catccccagc ggggtcccct atcgcgatga aatgtgtgcg 720 

aaaggcgaaa ccatctacaa cggcggaagc tgcatcgccg gacccgacgg tcagtggatc 780 

atcgcgcccg taaccgaccg tgaagagttg atcttcgcgg agatcgacca cgaacacgtc 840 

cgccgcgagc ggcagaattt cgacccggcc gggcattacg cgcggccc$a tgtgttgcaa 900 

ataaccgttg atcgtcgacg acaaacagcg gcgaatttta ttgatgacta a 951 

<210> 310 
<211> 316 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 310 

Leu Thr Asp Cys Leu Lys He Ala Ala Ala Gin He Thr Pro Val Phe 

15 10 15 

Leu Asp Arg Val Ala Thr Thr Lys Lys Val Val Glu Thr He Glu Lys 

20 25 30 

Ala Ala Ala Gly Gly Ala Arg Leu Val Ala Phe Gly Glu Ala Leu Leu 

35 40 45 

Pro Ala Tyr Pro Leu Trp Leu Thr Arg Thr Asp Ala Ala Arg Phe Asn 

50 55 60 

Ser Asp Val Gin Lys Asn Leu His Ala He Tyr Leu Lys Gin Ser Val 
65 70 75 80 

Ser He Ala Gly Gly His Leu Ser Pro He Cys Lys He Ala Ser Glu 

85 90 95 

Arg Lys He Ala Val He Leu Gly He Ala Glu Arg Ala Thr Asp Arg 

100 105 HO 

Gly Asp His Thr He Tyr Cys Ser Cys Val Phe He Asp Ala Asp Gly 

115 120 125 

Arg He Ala Ser Val His Arg Lys Leu Met Pro Thr Tyr Glu Glu Arg 

130 135 140 

Leu Ser Trp Gly Phe Gly Asp Gly Ala Gly Leu Val Thr His Pro Val 
145 150 155 160 

Gly Pro Phe Thr Val Gly Ala Leu Asn Cys Trp Glu Asn Trp Met Pro 

165 170 175 

Leu Ala Arg Thr Ala Leu Tyr Ala Gly Gly Glu Asp Leu His Val Ala 

180 185 190 

He Trp Pro Gly Gly Ser Val Leu Thr Glu Asp He Thr Arg Phe He 

195 200 205 

Ala Arg Glu Ser Arg Ser Phe Val Leu Ser Val Ser Gly He He Arg 

210 215 220 

Glu Ser Asp He Pro Ser Gly Val Pro Tyr Arg Asp Glu Met Cys Ala 
225 230 235 240 

Lys Gly Glu Thr He Tyr Asn Gly Gly Ser Cys He Ala Gly Pro Asp 

■ 245 250 255 

Gly Gin Trp He He Ala Pro Val Thr Asp Arg Glu Glu Leu He Phe 
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260 265 270 

Ala Glu lie Asp His Glu His Val Arg Arg Glu Arg Gin Asn Phe Asp 

275 280 285 

Pro Ala Gly His Tyr Ala Arg Pro Asp Val Leu Gin lie Thr Val Asp 

290 295 300 

Arg Arg Arg Gin Thr Ala Ala Asn Phe lie Asp Asp 
305 310 315 



<210> 311 
<211> 1011 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
<400> 311 

atgtcagaaa agcgaataat cagagcagct gcagttcaga tcacacctga atttgactca 60 

gcagatggaa cagttaagaa ggtatgcaag gtaatcgatg aagcaggcgc aaagggtgta 120 

caaattatag tattccccga aaccttcatt ccgtattacc catacttctc attcattacc 180 

cccccagttt ctgctggcgc tgagcatttg aagctttatg aaaaaagtgt cgtgatacct 240 

ggtcctgtca ctcaagcgat cgccgagaga gctagagtga atcaaatggt cgtcgtactc 300 

ggtgtaaacg agagagataa cggtagcctc tataacactc aactgatctt cgataccaat 360 

ggtgagttga tgttgaagag aagaaaaatc actcctacat atcatgagcg catgatctgg 420 

ggacaaggtg atgcttcagg cttaaaagta gttgaaacga gcattgcccg ggtaggtgct 480 

ctagcttgct gggaacatta caacccgctg gccagatatt ctctcatgac acagcatgaa 540 

gaaattcact gcgcacaatt cccaggttct atggttggcc aaatatttgc cgaccaaatg 600 

gatgtcacta tcagacatca cgcattggaa tctggctgtt tcgtcattaa tgccaccggc 660 

tggctcacag acgagcaaat ccagtccatt acagatgacc caaaaatgca gaaagcatta 720 

cgtggcggct gcaacacagc aatcatttct cccgaaggtg tgcacttaac agagccctta 780 

cgtgaaggtg aaggcatttt gattgctgac ctggacatgt cactcatcac aaaacgaaaa 840 

agaatgatgg attcagtagg tcattattca agacctgaac tattaagtct ggcgatcaat 900 

gacaagccag caacaacaaa attttcaatg actgaggggt gtactcaaac tgagcaattt 960 

cgaatcgcag aggagttgaa aaatgacgac aagcttagca ccggaaacta a 1011 

<210> 312 
<211> 336 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 312 



Met 


Ser 


Glu 


Lys 


Arg 


He 


He 


Arg 


Ala Ala 


Ala Val 


Gin 


He 


Thr 


Pro 


1 






5 








10 








15 




Glu 


Phe 


Asp 


Ser 


Ala 


Asp 


Gly 


Thr 


Val Lys 


Lys Val 


Cys 


Lys 


Val 


He 






20 










25 






30 






Asp 


Glu 


Ala 


Gly 


Ala 


Lys 


Gly 


Val 


Gin He 


He Val 


Phe 


Pro 


Glu 


Thr 




35 










40 






45 








Phe 


He 


Pro 


Tyr 


Tyr 


Pro 


Tyr 


Phe 


Ser Phe 


He Thr 


Pro 


Pro 


Val 


Ser 




50 






55 






60 










Ala 


Gly 


Ala 


Glu 


His 


Leu 


Lys 


Leu 


Tyr Glu 


Lys Ser 


Val 


Val 


He 


Pro 


65 








70 






75 






Gin 


80 


Gly 


Pro 


Val 


Thr 


Gin 


Ala 


lie 


Ala 


Glu Arg 


Ala Arg 


Val 


Asn 


Met 








85 








90 








95 




Val 


Val 


Val 


Leu 


Gly Val 


Asn 


Glu 


Arg Asp 


Asn Gly 


Ser 


Leu 


Tyr 


Asn 








100 










105 






110 






Thr 


Gin 


Leu 


He 


Phe 


Asp 


Thr 


Asn 


Gly Glu 


Leu Met 


Leu 


Lys 


Arg 


Arg 






115 








120 






125 








Lys 


He 


Thr 


Pro 


Thr 


Tyr 


His 


Glu 


Arg Met 


He Trp 


Gly 


Gin 


Gly 


Asp 
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130 135 140 

Ala Ser Gly Leu Lys Val Val Glu Thr Ser He Ala Arg Val Gly Ala 
145 150 155 160 

Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ser Leu Met 

165 170 175 

Thr Gin His Glu Glu He His Cys Ala Gin Phe Pro Gly Ser Met Val 

180 185 190 

Gly Gin He Phe Ala Asp Gin Met Asp Val Thr He Arg His His Ala 

195 200 205 

Leu Glu Ser Gly Cys Phe Val lie Asn Ala Thr Gly Trp Leu Thr Asp 

210 215 220 

Glu Gin He Gin Ser He Thr Asp Asp Pro Lys Met Gin Lys Ala Leu 
225 230 235 240 

Arg Gly Gly Cys Asn Thr Ala He He Ser Pro Glu Gly Val His Leu 

245 250 255 

Thr Glu Pro Leu Arg Glu Gly Glu Gly He Leu He Ala Asp Leu Asp 

260 265 270 

Met Ser Leu He Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His 

275 280 285 

Tyr Ser Arg Pro Glu Leu Leu Ser Leu Ala He Asn Asp Lys Pro Ala 

290 295 300 

Thr Thr Lys Phe Ser Met Thr Glu Gly Cys Thr Gin Thr Glu Gin Phe 
305 310 315 320 

Arg He Ala Glu Glu Leu Lys Asn Asp Asp Lys Leu Ser Thr Gly Asn 
325 330 335 



<210> 313 
<211> 987 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 313 

atggcggccg ttcaggctgc gccggttccg tttgatgctg aggcttcggt ggataaggcc 60 

tgtcgcttaa ttcaagaagc tgcagccaaa ggcgcagata tagttgcttt cggtgaggca 120 

tggctacccg gctaccccta ttttgcctgg ttaccccaag taacaccaga gtggtatagt 180 

gcggctgccg attatcttgc cagctctgtt gatatccctg gcccgatcac cgataaactt 240 

tgccaggctg cccgtcgtgc atcggttgaa ctcatcatgg gtgtggtaga acgcagcaag 300 

tctcagggaa ccacctattg cacgcttctt tttattagca aggatggcga aataattggc 360 

aagcaccgca agctgaagcc cacactcgcc gaacgaaccg tctggggtga aggcgatgcc 420 

agcggactga gggttcacga tcggcctatt gccagaatca gtgggctctc ctgctgggag 480 

aacaaaatga tgctgccagg ttacgcactg atggcgcagg gtacgcaggt gcatgtctcc 540 

gcctggccag ggatcccaga ggattcaccc atggaggtgc ctgcacaccc ccgtcaaaag 600 

ctgctttccc aagcctttgc actgcaaggc gggtgctatg ttatttctcc ctccattgtc 660 

cttagggcag aagatgtgcc cgagaaacac gccgctctac tgatgggaga ccaagtgggc 720 

ggtagctata tcattgaccc ctgcggaaaa gtgatcgccg aggccggtgc gggtgagact 780 

atcctgattg ccaaaggcaa ccttgacctc gtaagggccg ccaaaatggc cagtgatgta 840 

ggggggtctt attcacgccc ggatcttttg cagttgatga tcaataaccg accactcgaa 900 

cagctgattg agttcagtgc tgaaggtgca ggtaggggga atctagtatc caactcaccc 960 

gaggtgtcag aacaagaagg tgagtaa 987 



<210> 314 
<211> 328 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
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<400> 314 

Met Ala Ala Val Gin Ala Ala Pro Val Pro Phe Asp Ala Glu Ala Ser 

15 10 15 

Val Asp Lys Ala Cys Arg Leu He Gin Glu Ala Ala Ala Lys Gly Ala 

20 25 30 

Asp He Val Ala Phe Gly Glu Ala Trp Leu Pro Gly Tyr Pro Tyr Phe 

35 40 45 

Ala Trp Leu Pro Gin Val Thr Pro Glu Trp Tyr Ser Ala Ala Ala Asp 

50 55 60 

Tyr Leu Ala Ser Ser Val Asp He Pro Gly Pro He Thr Asp Lys Leu 
65 70 75 80 

Cys Gin Ala Ala Arg Arg Ala Ser Val Glu Leu He Met Gly Val Val 

85 90 95 

Glu Arg Ser Lys Ser Gin Gly Thr Thr Tyr Cys Thr Leu Leu Phe He 

100 105 110 

Ser Lys Asp Gly Glu He He Gly Lys His Arg Lys Leu Lys Pro Thr 

115 120 125 

Leu Ala Glu Arg Thr Val Trp Gly Glu Gly Asp Ala Ser Gly Leu Arg 

130 135 140 

Val His A3p Arg Pro He Ala Arg He Ser Gly Leu Ser Cys Trp Glu 
145 150 155 160 

Asn Lys Met Met Leu Pro Gly Tyr Ala Leu Met Ala Gin Gly Thr Gin 

165 170 175 

Val His Val Ser Ala Trp Pro Gly He Pro Glu Asp Ser Pro Met Glu 

180 185 190 

Val Pro Ala His Pro Arg Gin Lys Leu Leu Ser Gin Ala Phe Ala Leu 

195 200 205 

Gin Gly Gly Cys Tyr Val He Ser Pro Ser He Val Leu Arg Ala Glu 

210 215 220 

Asp Val Pro Glu Lys His Ala Ala Leu Leu Met Gly Asp Gin Val Gly 
225 230 235 240 

Gly Ser Tyr He He Asp Pro Cys Gly Lys Val He Ala Glu Ala Gly 

245 250 255 

Ala Gly Glu Thr He Leu He Ala Lys Gly Asn Leu Asp Leu Val Arg 

260 265 270 

Ala Ala Lys Met Ala Ser Asp Val Gly Gly Ser Tyr Ser Arg Pro Asp 

275 280 285 

Leu Leu Gin Leu Met He Asn Asn Arg Pro Leu Glu Gin Leu He Glu 

290 295 300 

Phe Ser Ala Glu Gly Ala Gly Arg Gly Asn Leu Val Ser Asn Ser Pro 
305 310 315 320 

Glu Val Ser Glu Gin Glu Gly Glu 
325 

<210> 315 
<211> 960 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 315 

atgacaatgc ttaagaccaa attccgggtt gctgcggtgc aggcagcgcc tgtattcctg 60 

gatcgagagg ctacgctgga gaaagcctgc ggactgattg aggaggcggg tcgcaacggg 120 

gccagcctgg tggtctttcc tgagtcgttc attccggcct atcccgattg ggtctgggct 180 

gtgccggcgg gtgaagaagc tttacttaat gaactgtatg ctcaactgct ggccaacgcc 24 0 

gttgaaattc ccagcccggc caccgaacgc ttgagccagg cagcgaaaaa ggctaaagtc 300 

catgtggtta tgggcctgac cgaacgcaac agcgaggcca gcggcggcag cctctacaat 360 

accttgctct atcttgaccc acagggccaa attctgggca aacatcgcaa gctggtgccc 420 

accggcggcg agcggctggt ttgggcccag ggcgacggca gtaccctgca agtctacgag 4 80 
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900 
960 



acccccttgg gtaaactcag cggcttgatt tgctgggaaa attatatgcc gctggcccgc 540 

tacgcgctct atgcctgggg tacgcaaatc tatatcgccg ccacctggga tcgaggcgag 600 

ccgtggcttt cgacgctgcg gcatatcgcc aaagagggcc gggtgtttgt catcggctgt 660 

ggcatggcct tgcgtaaggc tgatattccc gaccattttg aattcaagca gcgcttttat 720 

caaaatgccg gcgagtggat caatggaggc gacagcgcca ttgtcaatcc tgagggtgaa 780 

tttattgctg gacccctgag cgagcaggaa ggtattttgt acgccgagat tgatcctggc 840 
cagatggccg gaccaaaatg gatgctcgat gtggccgggc actatgctcg cccggatgtc 
tttgaactga ccgtccagac cgtagctcgg cccatgatta cctctactca gccgcgatag 

<210> 316 
<211> 319 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 316 

Met Thr Met Leu Lys Thr Lys Phe Arg Val Ala Ala Val Gin Ala Ala 

X 5 10 15 

Pro Val Phe Leu Asp Arg Glu Ala Thr Leu Glu Lys Ala Cys Gly Leu 

20 25 30 

lie Glu Glu Ala Gly Arg Asn Gly Ala Ser Leu Val Val Phe Pro Glu 

35 40 45 

Ser Phe lie Pro Ala Tyr Pro Asp Trp Val Trp Ala Val Pro Ala Gly 

50 55 60 

Glu Glu Ala Leu Leu Asn Glu Leu Tyr Ala Gin Leu Leu Ala Asn Ala 
65 70 75 80 

Val Glu He Pro Ser Pro Ala Thr Glu Arg Leu Ser Gin Ala Ala Lys 

85 90 95 

Lys Ala Lys Val His Val Val Met Gly Leu Thr Glu Arg Asn Ser Glu 

100 105 HO 

Ala Ser Gly Gly Ser Leu Tyr Asn Thr Leu Leu Tyr Leu Asp Pro Gin 

115 120 125 

Gly Gin He Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly Gly Glu 

130 135 140 

Arg Leu Val Trp Ala Gin Gly Asp Gly Ser Thr Leu Gin Val Tyr Glu 
145 150 155 160 

Thr Pro Leu Gly Lys Leu Ser Gly Leu He Cys Trp Glu Asn Tyr Met 

165 170 175 

Pro Leu Ala Arg Tyr Ala Leu Tyr Ala Trp Gly Thr Gin He Tyr He 

180 185 190 

Ala Ala Thr Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu Arg His 

195 200 205 

He Ala Lys Glu Gly Arg Val Phe Val He Gly Cys Gly Met Ala Leu 

210 215 220 

Arg Lys Ala Asp He Pro Asp His Phe Glu Phe Lys Gin Arg Phe Tyr 
225 230 235 240 

Gin Asn Ala Gly Glu Trp He Asn Gly Gly Asp Ser Ala He Val Asn 

245 250 255 

Pro Glu Gly Glu Phe He Ala Gly Pro Leu Ser Glu Gin Glu Gly He 

260 265 ' 270 

Leu Tyr Ala Glu He Asp Pro Gly Gin Met Ala Gly Pro Lys Trp Met 

275 280 285 

Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Glu Leu Thr 

290 295 300 

Val Gin Thr Val Ala Arg Pro Met He Thr Ser Thr Gin Pro Arg 
305 310 315 

<210> 317 
<211> 993 
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<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 317 

atgaccattg taaaagccgc 
accgtggaca aggttgtcca 
actttcccgg agacggtggt 
gccctcaagg tgggcaagga 
gccaccacgc tcgccatcgg 
gtcaacgaac gcgacggcag 
accttgattc agcgccgccg 
cagggcgacg gctccgggct 
gcgtgctggg agcactacaa 
atccactcgg cgatgtaccc 
gtcaacatcc gccagcacgc 
ctggacgccg atcagcaggc 
tccagtggtt gcttcaccgc 
cggtccggtg agggcgaggt 
cagatgatgg attcacgcgg 
cgcacggcga ccgcgcatgt 
gcggacggtc catccgccgt 

<210> 318 
<211> 330 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 318 
















Met Thr 


lie Val 


Lys 


Ala 


Ala 


Ala Val 


Gin 


He Ser Pro Val Leu 


Tyr 


1 




5 








10 


15 




Ser Arg 


Glu Gly 


Thr 


Val 


Asp 


Lys Val 


Val 


Gin Lys He Leu Glu 


Leu 


20 








25 




30 




Gly Lys 


Gin Gly 


Val 


Gin 


Phe 


Ala Thr 


Phe 


Pro Glu Thr Val Val 


Pro 


35 








40 




45 




Tyr Tyr 


Pro Tyr 


Phe 


Ser 


Phe 


Val Gin 


Ser 


Gly Tyr Ala Leu Lys 


Val 


50 








55 






60 




Gly Lys 


Glu His 


Leu Arg 


Leu 


Leu Glu 


Gin 


Ser Val Thr Val Pro 


Ser 


65 






70 








75 


80 


Ala Thr 


Thr Leu 


Ala 


He 


Gly 


Glu Ala 


Cys 


Lys Gin Ala Gly Met 


Val 






85 






90 


95 




Val Ser 


He Gly 


Val 


Asn 


Glu 


Arg Asp Gly 


Ser Thr He Tyr Asn 


Thr 




100 








105 




110 




Gin Leu 


Leu Phe 


Asp Ala 


Asp 


Gly Thr Leu 


He Gin Arg Arg Arg Lys 




115 








120 




125 




lie Ser 


Pro Thr 


Phe 


His 


Glu 


Arg Met 


Val 


Trp Gly Gin Gly Asp Gly 


130 








135 






140 




Ser Gly 


Leu Arg 


Ala 


Val 


Asp 


Ser Ala 


Val 


Gly Arg He Gly Gin 


Leu 


145 




150 






155 


160 


Ala Cys 


Trp Glu 


His 


Tyr 


Asn 


Pro Leu 


Ala 


Arg Tyr Ala Met Met 


Ala 


165 






170 


175 




Asp Gly 


Glu Gin 


He 


His 


Ser 


Ala Met 


Tyr 


Pro Gly Ser Phe Ala 


Gly 


180 








185 




190 




Asp Ala 


Phe Ser 


Glu 


Gin 


He 


Gin Val 


Asn 


He Arg Gin His Ala 


Leu 


195 








200 




205 




Glu Ala 


Gly Cys 


Phe 


Val 


Val 


Asn Ala 


Thr 


Ala Trp Leu Asp Ala Asp 



tgccgtccaa ataagccctg tcctttacag ccgcgaaggc 60 

gaagatcctc gaactcggca agcaaggcgt ccagttcgcc 120 

cccctactac ccctacttct ccttcgtcca gtcgggctac 180 

acatctgcgc ttgctcgaac agtcggtcac cgtgccatcg 240 

cgaagcctgc aagcaggcgg ggatggtggt gtccatcggc 300 

cacgatctac aacacgcagc tgctcttcga tgccgacggc 360 

aaagatcagc ccgaccttcc atgaacgcat ggtctggggc 420 

gcgcgcggtc gacagcgcgg tcgggcgcat cggccagttg 480 

cccgctggcc cgctacgcca tgatggccga cggcgagcag 540 

cggttccttc gcaggcgacg ccttctccga acagatccag 600 

attggaagcc ggctgcttcg tcgtgaacgc caccgcgtgg 660 

gcagatcatg caggacaccg gttgcgccat cggcccgatc 720 

catcgtttcg ccggacggcg tgttgctggg cgagcctctg 780 

gattgccgat ctcgacttca cgctgatcga caagcgcaag 840 

gcactatgcg cgcccggaat tgctcagcct gttgatcgac 900 

gcatgagcgc agcgcgcatc cgaaggcgac tgccgagcag 960 

gaatgcgcag taa 993 
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210 215 



Uj.il OJ.Il 


Ala 


Gin 


He 


Met 


Gin Asp 


225 








230 






Ser Ser 


uiy 


Cys 


Phe 
245 


i nr 


Ala 


He 


Gly Glu 


Pro 


Leu Arg 


Ser 


Gly Glu 






260 










Phe Thr 


Leu 


lie Asp 


Lys 


Arg 


Lys 




275 










280 


Tyr Ala 


Arg 


Pro 


Glu 


Leu 


Leu 


Ser 


290 










295 




Ala His 


Val 


His 


Glu 


Arg 


Ser 


Ala 


305 








310 






Ala Asp 


Gly 


Pro 


Ser 
325 


Ala 


Val 


Asn 









220 










Thr 


Gly 


Cys 


Ala 


He 


Gly Pro 


He 






235 










240 


Val 


Ser 


Pro 


Asp 


Gly Val 


Leu 


Leu 




250 










255 




Gly Glu 


Val 


He 


Ala 


Asp 


Leu 


Asp 


265 










270 






Gin 


Met 


Met 


Asp 


Ser 
285 


Arg 


Gly 


His 


Leu 


Leu 


He 


Asp 
300 


Arg 


Thr 


Ala 


Thr 


His 


Pro 


Lys 
315 


Ala 


Thr 


Ala 


Glu 


Gin 
320 


Ala 


Gin 
330 















<210> 319 
<211> 1017 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 319 

atgacgaccc cgcgtatcgt ccgcgtcgcc gcggtgcaga tggcgcccga cctggaatcc 60 

gcacacggca cggtagacaa ggtctgccgc gccattctgg aggccggcga aaaaggcgcg 120 

cgcatggtgg tctttcccga gaccttcgtg ccctactacc cctacttctc cttcatccag 180 

ccggcggtga cgatgggcgc cgcccacctc gagctctacg agcgcgcggt gacggtgccc 240 

ggcccggtca ccctggccgt gggcgaagcg gcgcggcggg cgggcgcggt cgtcgtgctc 300 

ggcgtcaatg aacgcgacca cggctcgctc tacaacaccc agctgatctt cgacgagacc 360 

ggcgcgctgg tgctcaagcg ccgcaagctc accccgacct atcacgagcg catggtgtgg 420 

ggccagggcg atggcagcgg tctcaaggtg gtggacaccg gcatcggccg catcggcgcg 480 

ctggcctgct gggagcacta caacccgctc gcccgctaca ccctgatggc gcagcatgaa 540 

gagatccacg ccgcgcagtt ccccggttcc atggtcggcc agatctttgc cgaccagatg 600 

gcggtgacca tccgccacca cgcgctggaa tccggctgct tcgtggtgaa cgccaccggc 660 

tggctcaccg atgcgcagat caccgccatc acccccgacc cggccatgca gcgcgcgctg 720 

cgcggcggct gccacaccgc catcgtgtcg ccggaaggca gctacgtctg cgaaccgctc 780 

accgagggcg aaggcatgct ggtggccgac ctcgacatgc ggctggtcac caagcgcaag 840 

cggatgatgg attcggtcgg ccactacgcc cggccggagc tgctttcgct caacgccgat 900 

cttgccccca agccggcgct gcacacacag ccggctgcgt ccctccctct ctcccttcag 960 

gcaggagccg accatgtcga cgacgaccgc accgcgtccg caacagctga tctctga 1017 



<210> 320 
<211> 338 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 320 

Met Thr Thr Pro Arg He Val Arg 

1 5 
Asp Leu Glu Ser Ala His Gly Thr 
20 

Leu Glu Ala Gly Glu Lys Gly Ala 

35 40 
Phe Val Pro Tyr Tyr Pro Tyr Phe 

50 55 
Met Gly Ala Ala His Leu Glu Leu 



Val Ala Ala Val Gin Met Ala Pro 

10 15 
Val Asp Lys Val Cys Arg Ala He 
25 30 
Arg Met Val Val Phe Pro Glu Thr 
45 

Ser Phe He Gin Pro Ala Val Thr 
60 

Tyr Glu Arg Ala Val Thr Val Pro 
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65 70 75 80 

Gly Pro Val Thr Leu Ala Val Gly Glu Ala Ala Arg Arg Ala Gly Ala 

85 90 95 

Val Val Val Leu Gly Val Asn Glu Arg Asp His Gly Ser Leu Tyr Asn 

100 105 110 

Thr Gin Leu lie Phe Asp Glu Thr Gly Ala Leu Val Leu Lys Arg Arg 

115 120 125 

Lys Leu Thr Pro Thr Tyr His Glu Arg Met Val Trp Gly Gin Gly Asp 

130 135 140 

Gly Ser Gly Leu Lys Val Val Asp Thr Gly lie Gly Arg lie Gly Ala 
145 150 155 160 

Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Thr Leu Met 

165 170 175 

Ala Gin His Glu Glu lie His Ala Ala Gin Phe Pro Gly Ser Met Val 

180 185 190 

Gly Gin lie Phe Ala Asp Gin Met Ala Val Thr He Arg His His Ala 

195 200 205 

Leu Glu Ser Gly Cys Phe Val Val Asn Ala Thr Gly Trp Leu Thr Asp 

210 215 220 

Ala Gin He Thr Ala He Thr Pro Asp Pro Ala Met Gin Arg Ala Leu 
225 230 235 240 

Arg Gly Gly Cys His Thr Ala He Val Ser Pro Glu Gly Ser Tyr Val 

245 250 255 

Cys Glu Pro Leu Thr Glu Gly Glu Gly Met Leu Val Ala Asp Leu Asp 

260 265 270 

Met Arg Leu Val Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His 

275 280 285 

Tyr Ala Arg Pro Glu Leu Leu Ser Leu Asn Ala Asp Leu Ala Pro Lys 

290 295 300 

Pro Ala Leu His Thr Gin Pro Ala Ala Ser Leu Pro Leu Ser Leu Gin 
305 310 315 320 

Ala Gly Ala Asp His Val Asp Asp Asp Arg Thr Ala Ser Ala Thr Ala 
325 330 335 

Asp Leu 



<210> 321 
<211> 993 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<400> 321 cn 

atgagagttg tcaaagccgc tgctgtccaa ctgagtcctg tcctccatag ccgcgacgga bU 

acggtcgaaa aggtcgtgcg gaagatccat gaactcgccg aagagggagt cgagttcgcc 120 

acctttcctg agaccgtggt gccttactac ccgtactttt ccttcgttca gacgcccttg 180 

cagcaaatct tcggaactga gtatctgagg ctgctcgacc aggcagtcac cgtgccatcc 240 

gctgccaccg acgcgatcgg cgaggctgcc aggtgggctg gacttgttgt ctcgatcggc 300 

gtcaacgagc gagacggggg aactctctac aacactcagc ttctcttcga tgccgacggc 360 

agcttaattc agcggcgtcg caagatcaca cccacccatt acgagcgcat gatctggggc 420 

cagggcgacg gctcaggtct gcgggccgtt gatagcaagg ccggccgcat tggtcagctg 480 

gcatgctggg aacacaacaa ccccctggcg cgctacgcgc tgatggccga cggcgagcag 540 

atccattcgg ccatgtatcc gggctccatg ttcggcgact cgttttccca aaagaccgaa 600 

atcaatatcc ggcagcatgc gctggaatct gcgtgcttcg tcgtgaacgc aacggcctgg 660 

ctggacgccg atcagcaggc gcaaatcatg aaggacaccg gctgcggcat cggcccgatc 720 

tccggcggtt gcttcactgc gatcgttgca cccgatggta gcctgctggg cgaacccatc 780 

cgttccggtg agggcgtcgt cgtcgccaac ctcgacttca cgctgatcga caggcgtaag Q * n 
caggtgatgg actcgcgagg ccactacagc cggccggagt tgctcagcct cttaatagac 
cgcactccta ccgcgcacgt tcacgaacgc gctacgcacc ccacgacagg agctgagcaa 



840 
900 
960 
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ggctccgagg atgtgttcga ggctcgcatt taa 993 

<210> 322 
<211> 330 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 322 

Met Arg Val Val Lys Ala Ala Ala Val Gin Leu Ser Pro Val Leu His 

15 10 15 

Ser Arg Asp Gly Thr Val Glu Lys Val Val Arg Lys lie His Glu Leu 

20 25 30 

Ala Glu Glu Gly Val Glu Phe Ala Thr Phe Pro Glu Thr Val Val Pro 

35 40 45 

Tyr Tyr Pro Tyr Phe Ser Phe Val Gin Thr Pro Leu Gin Gin lie Phe 

50 55 60 

Gly Thr Glu Tyr Leu Arg Leu Leu Asp Gin Ala Val Thr Val Pro Ser 
65 70 75 80 

Ala Ala Thr Asp Ala lie Gly Glu Ala Ala Arg Trp Ala Gly Leu Val 

85 90 95 

Val Ser lie Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn Thr 

100 105 110 

Gin Leu Leu Phe Asp Ala Asp Gly Ser Leu lie Gin Arg Arg Arg Lys 

115 120 125 

lie Thr Pro Thr His Tyr Glu Arg Met He Trp Gly Gin Gly Asp Gly 

130 135 140 

Ser Gly Leu Arg Ala Val Asp Ser Lys Ala Gly Arg He Gly Gin Leu 
145 150 155 160 

Ala Cys Trp Glu His Asn Asn Pro Leu Ala Arg Tyr Ma Leu Met Ala 

165 170 175 

Asp Gly Glu Gin He His Ser Ala' Met Tyr Pro Gly Ser Met Phe Gly 

180 185 190 

Asp Ser Phe Ser Gin Lys Thr Glu He Asn He Arg Gin His Ala Leu 

195 200 205 

Glu Ser Ala Cys Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp 

210 215 220 

Gin Gin Ala Gin He Met Lys Asp Thr Gly Cys Gly He Gly Pro He 
225 230 235 240 

Ser Gly Gly Cys Phe Thr Ala He Val Ala Pro Asp Gly Ser Leu Leu 

245 250 255 

Gly Glu Pro He Arg Ser Gly Glu Gly Val Val Val Ala Asn Leu Asp 

260 265 270 

Phe Thr Leu He Asp Arg Arg Lys Gin Val Met Asp Ser Arg Gly His 

275 280 285 

Tyr Ser Arg Pro Glu Leu Leu Ser Leu Leu He Asp Arg Thr Pro Thr 

290 295 300 

Ala His Val His Glu Arg Ala Thr His Pro Thr Thr Gly Ala Glu Gin 
305 310 315 320 

Gly Ser Glu Asp Val Phe Glu Ala Arg He 
325 330 

<210> 323 
<211> 951 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
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<400> 323 

atgagcaacg tgaaagtcgc ggtcgtgcaa cgcgcgccgg tgttcggcaa ccgcgccgcc 60 

accctcgatc gcgccgtcgc cgcgctggcc gaagcggcgc agcagggggc gaagctggtc 120 

gtgatgcccg agcacttcat ccccggctac ccggcctgga tctggcgcct gcgtccgggc 180 

accgacctgc gattgtgcga acagctgcac gcgatgctgc gcgccaacgc cgtgaggctg 240 

gatgacggtg acctggcccc gttgaccgag gccgcgcagc ggcatgcgct caccgtggtc 300 

tgcggcgtct gcgagatcga caccgaattc agtcgcggca ccctgtacaa caccgtggtc 360 

gtgatcgggc ccgacggcac gctgctcaac cggcatcgca agctgatgcc caccaacccc 420 

gagcgcatgg tctggggcat gggcgacgcc acggggctga aggtggtcga cacgccctgc 480 

gggcgcatcg gcacgctgat ttgctgggag aactacatgc cattcgcacg cgccgcgctg 540 

tacgcgcagg gggtcgaggt cctggttgca ccgacctacg acgaaggccc ggtatggctg 600 

gcgtcgatgc agcacatcgc ccgcgaaggc ggctgctggg tggtgggcaa cggctgcgca 660 

ttccagggcc gcgacatgcc ggacaccttg ccgggcaagg cccagctgtt tcccgaggcc 720 

gacgcctggg tcaacgccgg ggactcggtc atcgtcgcgc caggcggccg gacagtggcg 780 

ggtccgttgc acgaggcgtt cgggctgttc accgccgaga tcgacctctc ccgggtcgga 840 

atggcccggc gcagcctgga tgtggccggg cactatggac ggcccgacat cttctgcctg 900 

caggtcaacg cccgggcgca gccgccggtt gaggtgacgc accatggctg a 951 

<210> 324 
<211> 316 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 324 

Met Ser Asn Val Lys Val Ala Val Val Gin Arg Ala Pro Val Phe Gly 

15 10 15 

Asn Arg Ala Ala Thr Leu Asp Arg Ala Val Ala Ala Leu Ala Glu Ala 

20 25 30 

Ala Gin Gin Gly Ala Lys Leu Val Val Met Pro Glu His Phe lie Pro 

35 40 45 

Gly Tyr Pro Ala Trp He Trp Arg Leu Arg Pro Gly Thr Asp Leu Arg 

50 55 60 

Leu Cys Glu Gin Leu His Ala Met Leu Arg Ala Asn Ala Val Arg Leu 
65 70 75 80 

Asp Asp Gly Asp Leu Ala Pro Leu Thr Glu Ala Ala Gin Arg His Ala 

85 90 95 

Leu Thr Val Val Cys Gly Val Cys Glu He Asp Thr Glu Phe Ser Arg 

100 105 HO 

Gly Thr Leu Tyr Asn Thr Val Val Val He Gly Pro Asp Gly Thr Leu 

115 120 125 

Leu Asn Arg His Arg Lys Leu Met Pro Thr Asn Pro Glu Arg Met Val 

130 135 140 

Trp Gly Met Gly Asp Ala Thr Gly Leu Lys Val Val Asp Thr Pro Cys 
145 150 155 160 

Gly Arg lie Gly Thr Leu He Cys Trp Glu Asn Tyr Met Pro Phe Ala 

165 170 175 

Arg Ala Ala Leu Tyr Ala Gin Gly Val Glu Val Leu Val Ala Pro Thr 

180 185 190 

Tyr Asp Glu Gly Pro Val Trp Leu Ala Ser Met Gin His He Ala Arg 

195 200 205 

Glu Gly Gly Cys Trp Val Val Gly Asn Gly Cys Ala Phe Gin Gly Arg 

210 215 220 

Asp Met Pro Asp Thr Leu Pro Gly Lys Ala Gin Leu Phe Pro Glu Ala 
225 230 235 240 

Asp Ala Trp Val Asn Ala Gly Asp Ser Vai He Val Ala Pro Gly Gly 

245 250 255 

Arg Thr Val Ala Gly Pro Leu His Glu Ala Phe Gly Leu Phe Thr Ala 
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260 265 270 

Glu He Asp Leu Ser Arg Val Gly Met Ala Arg Arg Ser Leu Asp Val 

275 280 285 

Ala -Gly His Tyr Gly Arg Pro Asp He Phe Cys Leu Gin Val Asn Ala 

290 295 300 

Arg Ala Gin Pro Pro Val Glu Val Thr His His Gly 
305 310 315 

<210> 325 
<211> 1077 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 325 

atgactgcaa aaaagattgt ccgcgccgcc gctgtccagc tcaatcccgt gctggacagc 60 

gccgacggca cccttgtgaa agtgttgcag gcgattgccg acgccgcagc gcagggcgtg 120 

caactgatcg tctttcccga gacggtggtg ccctactatc cttacttttc cttcgtcacg 180 

cctgcggttt cgatgggcgc ggcgcatctg aaactgtacg aacaatcgcc cacggtgcca 240 

ggtccactga ccgacgccgt cgccgcagcc gcgcgggcac atcagatggt ggttgtgctc 300 

ggcgtcaacg agcgcgatca cggcacgctc tacaacacgc aactgatctt cgacgccgac 360 

ggcacgctcc cactgaagcg tcgcaagatc acgccgacct atcacgagcg catggtctgg 420 

ggcatgggcg atggctccgg cctgcgcacg gtgaagaccg aggtcggaac cgttggcgcg 480 

ctggcctgct gggaacacta caacccgctg gcacgctacg cgctgatggc gcagcacgaa 540 

gagatccatt gcagccagtt ccccggctcg ctggtcggcc cgatcttttc cgagcagatg 600 

gaaatcacca tgcgtcatca cgcgctggaa tccggctgct tcgtcgtcaa cgccactgca 660 

tggctgacgc ctgagcaggt gcgatcacag gcgccaacac cggcaatgga aaaagccttc 720 

tccggtggtt gctacaccgc gatcatttcg ccggaaggaa aacatctggg cgaacctctt 780 

cgcgacggcg aaggcatggt catcgccgat cttgattttg atctcatcac caagcgcaag 840 

cgaatgatgg attcggttgg ccactacgca cggccggaat tgttgagcct gcagctcgac 900 

aaccgatcaa ctgcaccgct gacaacgtcg ccggtggccg ccgcagcgcc gtcgcttgca 960 

gagatggaag cacagcgcct gtcacgttat ctcgatgcca gctccggcag cgccgcacaa 1020 

ggcatcgaag ccgcctacat caatgccctc agctcttttt caggaaaacc ctcatga 1077 

<210> 326 

<211> 358 

<212> PRT 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 326 



Met 


Thr 


Ala 


Lys 


Lys He 


Val 


Arg 


Ala 


Ala 


Ala Val 


Gin 


Leu 


Asn 


Pro 


1 






5 








10 








15 




Val 


Leu 


Asp 


Ser 


Ala Asp 


Gly 


Thr 


Leu 


Val 


Lys Val 


Leu 


Gin 


Ala 


He 






20 








25 








30 






Ala 


Asp 


Ala 


Ala 


Ala Gin 


Gly 


Val 


Gin 


Leu 


He Val 


Phe 


Pro 


Glu 


Thr 




35 








40 








45 








Val 


Val 


Pro 


Tyr 


Tyr Pro 


Tyr 


Phe 


Ser 


Phe 


Val Thr 


Pro 


Ala 


Val 


Ser 




50 






55 








60 










Met 


Gly 


Ala 


Ala 


His Leu 


Lys 


Leu 


Tyr 


Glu 


Gin Ser 


Pro 


Thr 


Val 


Pro 


65 






70 










75 








80 


Gly 


Pro 


Leu 


Thr 


Asp Ala 


Val 


Ala 


Ala 


Ala 


Ala Arg 


Ala 


His 


Gin 


Met 








85 








90 








95 




Val 


Val 


Val 


Leu 
100 


Gly Val 


Asn 


Glu 


Arg 
105 


Asp 


His Gly 


Thr 


Leu 
110 


Tyr 


Asn 


Thr 


Gin 


Leu 


He 


Phe Asp 


Ala 


Asp 


Gly 


Thr 


Leu Pro 


Leu 


Lys 


Arg 


Arg 



115 120 125 
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Lys He 


Thr 


Pro 


Thr 


Tyr 


His 


Glu Arg Met Val Trp Gly Met Gly 


Asp 


130 










135 


140 




Gly Ser 


Gly 


Leu 


Arg 


Thr 


Val 


Lys Thr Glu Val Gly Thr Val Gly 


Ala 


145 








150 




155 * 


160 


Leu Ala 


Cys 


Trp 


Glu 
165 


His 


Tyr 


Asn Pro Leu Ala Arg Tyr Ala Leu 
170 175 


Met 


Ala Gin 


His 


Glu 
180 


Glu 


He 


His 


Cys Ser Gin Phe Pro Gly Ser Leu 
185 190 


Val 


Gly Pro 


He 
195 


Phe 


Ser 


Glu 


Gin 


Met Glu He Thr Met Arg His His 
200 205 


Ala 


Leu Glu 


Ser 


Gly 


Cys 


Phe 


Val 


Val Asn Ala Thr Ala Trp Leu Thr 


Pro 


210 










215 


220 




Glu Gin 


Val 


Arg 


Ser 


Gin 


Ala 


Pro Thr Pro Ala Met Glu Lys Ala 


Phe 


225 








230 




235 


240 


Ser Gly 


Gly 


Cys 


Tyr 
245 


Thr 


Ala 


He He Ser Pro Glu Gly Lys His 
250 255 


Leu 


Gly Glu 


Pro 


Leu 


Arg Asp Gly 


Glu Gly Met Val He Ala Asp Leu 


Asp 






260 








265 270 




Phe Asp 


Leu 
275 


He 


Thr 


Lys 


Arg 


Lys Arg Met Met Asp Ser Val Gly 
280 285 


His 


Tyr Ala 


Arg 


Pro 


Glu 


Leu 


Leu 


Ser Leu Gin Leu Asp Asn Arg Ser 


Thr 


290 










295 


300 




Ala Pro 


Leu 


Thr 


Thr 


Ser 


Pro 


Val Ala Ala Ala Ala Pro Ser Leu 


Ala 


305 








310 




315 


320 


Glu Met 


Glu 


Ala 


Gin 
325 


Arg 


Leu 


Ser Arg Tyr Leu Asp Ala Ser Ser 
330 335 


Gly 


Ser Ala 


Ala 


Gin 


Gly He Glu 


Ala Ala Tyr He Asn Ala Leu Ser 


Ser 






340 








345 350 




Phe Ser 


Gly 
355 


Lys 


Pro 


Ser 









<210> 327 
<211> 975 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 327 

atggttgatc aaatacttaa cgatcgttct gaattattaa cagttggttt ggcgcagatt 60 

gctccaatct ggctgaaccg cgaaaaaaca cttgcgaaag tggttgagaa agtaaatcaa 120 

gctgcaaagc aagattgtca tcttgttgct tttggtgaag ctttggtccc cggatatccc 180 

ttttggattg aacttacaga tggcgcgcga ttcaactcca atgttcaaaa agaaattcac 240 

gcgcactata tggatcaggc agtacagata gagaatggtc atttaaaagc gctttgtgaa 300 

acatcggccg caaacaagat tgccgtgatc gttgggtgca ttgaacgcgc agccgatcgc 360 

ggcgggcaca gcttatatgc ttcattggtt tttattaatc ctcaagggca aatcggatcg 420 

gtgcatcgca aacttatgcc aacttatgaa gaacgtttaa cctggtcacc tggcgatgga 480 

catggtttgc gaacgcatca actaggtgct ttcactgttg gcggcttgaa ttgttgggaa 540 

aactggatgc cattgccacg cgccgcattg tacgcacaag gtgaagactt tcatgtcgca 600 

atctggccgg gaagcattca caatacgcaa gatattacgc gctatattgc gaaggaatcc 660 

agatcgtttg taatgtctac ttccgggttc atgcgtaaag aagactttcc atctgatacg 720 

cctcacttag acaaaatact tgcaaactct ccgaatgtac ttgcaaatgg tggatcctgt 780 

ctcgccggac cggacggtca gtggatcgtt gaaccatttg tgaatgagga aaaattagtt 840 

gttgcgactg tgaaccacaa gcaggtccgt gaagaaagac aaaattttga tccggtcgga 900 

cactattccc gtcctgatgt cacgcagttg attgtgaaca gacaaagaca atcaacaatc 960 

aagctcaacg attaa 9*75 

<210> 328 
<211> 324 
<212> PRT 
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<213> Unknown 
<220> 

<223> Obtained from an environmental sample 
<400> 328 

Met Val Asp Gin He Leu Asn Asp Arg Ser Glu Leu Leu Thr Val Gly 

15 10 15 

Leu Ala Gin He Ala Pro He Trp Leu Asn Arg Glu Lys Thr Leu Ala 

20 25 30 

Lys Val Val Glu Lys Val Asn Gin Ala Ala Lys Gin Asp Cys His Leu 

35 40 45 

Val Ala Phe Gly Glu Ala Leu Val Pro Gly Tyr Pro Phe Trp He Glu 

50 55 60 

Leu Thr Asp Gly Ala Arg Phe Asn Ser Asn Val Gin Lys Glu He His 
65 70 75 80 

Ala His Tyr Met Asp Gin Ala Val Gin He Glu Asn Gly His Leu Lys 

85 90 95 

Ala Leu Cys Glu Thr Ser Ala Ala Asn Lys He Ala Val He Val Gly 

100 105 HO 

Cys He Glu Arg Ala Ala Asp Arg Gly Gly His Ser Leu Tyr Ala Ser 

115 120 125 

Leu Val Phe He Asn Pro Gin Gly Gin He Gly Ser Val His Arg Lys 

130 135 140 

Leu Met Pro Thr Tyr Glu Glu Arg Leu Thr Trp Ser Pro Gly Asp Gly 
145 150 155 160 

His Gly Leu Arg Thr His Gin Leu Gly Ala Phe Thr Val Gly Gly Leu 

165 170 175 

Asn Cys Trp Glu Asn Trp Met Pro Leu Pro Arg Ala Ala Leu Tyr Ala 

180 185 190 

Gin Gly Glu Asp Phe His Val Ala He Trp Pro Gly Ser He His Asn 

195 200 205 

Thr Gin Asp He Thr Arg Tyr He Ala Lys Glu Ser Arg Ser Phe Val 

210 215 220 

Met Ser Thr Ser Gly Phe Met Arg Lys Glu Asp Phe Pro Ser Asp Thr 
225 230 235 240 

Pro His Leu Asp Lys He Leu Ala Asn Ser Pro Asn Val Leu Ala Asn 

245 250 255 

Gly Gly Ser Cys Leu Ala Gly Pro Asp Gly Gin Trp He Val Glu Pro 

260 265 270 

Phe Val Asn Glu Glu Lys Leu Val Val Ala Thr Val Asn His Lys Gin 

275 280 285 

Val Arg Glu Glu Arg Gin Asn Phe Asp Pro Val Gly His Tyr Ser Arg 

290 295 300 

Pro Asp Val Thr Gin Leu He Val Asn Arg Gin Arg Gin Ser Thr He 
305 310 315 320 

Lys Leu Asn Asp 

<210> 329 
<211> 1023 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 329 

gtgccggcca aagtacgcgc cgccgccgtg cagctcagcc ccgtgctctt cagccgtgaa 60 
ggcacgacca gcaaggtctg cgacaagatt gccgaggcgg cggcgcaggg cgccgagctg 120 
gtggtgtttc ccgaaaccgt agtgccgtat tacccgtatt tctcgttcat caaggctccg 180 
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600 
660 
720 



840 
900 
960 
1020 
1023 



gccgtgatcg gcgccgagca cttactcttg ctcgaacaag ccgtcacggt gcccgggccc 240 
agcgtcgaag ccatcgcaga agccgctcgc aaggcgggcg cggtggtttc gatcggcgtc 300 
aacgaacgcg atcacggcac gctgtacaac acccagctgt tgttcgacgc cgacggtcgg 360 
ttggcgcaag cccgccgcaa gatcaccccc acgtatcacg agcggatgat ctgggggcag 4^0 
ggcgatggct cgggcttggt ggcggtggat acgcgagtcg gcaggattgg ctccctggcg 480 
tgctgggagc actacaaccc gctggctcgc tatgcgctga tggccgacca cgagcaaatt 540 
cacgtggcca tgttccctgg ctcgctcgtg ggcgacatct tccgcgagca aatcgaggtc 600 
acgattcggc accacgcgct cgagtcgggc tgcttcgtcg tcaacgcgac gggctacctg 
agcgacgcgc aggtgacgca gatcgcgggc gacaccaagc tcgaccgcgc cctgcgcggc 
ggttgcttca ccgccatcgt atcgcccgag ggcacgctgc tggcgccacc gctcaccgac 780 
ggtgagggca tggtcattgc cgacctcgat ctgtcgctca tcgccaaacg caaacgcatg 840 
atggacagcg tcggccatta cagccggccc gagctgctca gcgtgctgat cgaccgctcg 
ccgcagccgc atttgcgcga gaaaactgcc tctttacccg aacctcggat gtcccatgaa 
tcgctcgctc ccgatagtaa cggcctccgc gacgctgacg cgaaagcctc gaccctctcc 
tga 

<210> 330 
<211> 340 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 330 

Val Pro Ala Lys Val Arg Ala Ala Ala Val Gin Leu Ser Pro Val Leu 

15 10 15 

Phe Ser Arg Glu Gly Thr Thr Ser Lys Val Cys Asp Lys He Ala Glu 

20 25 30 

Ala Ala Ala Gin Gly Ala Glu Leu Val Val Phe Pro Glu Thr Val Val 

35 40 45 

Pro Tyr Tyr Pro Tyr Phe Ser Phe He Lys Ala Pro Ala Val He Gly 

50 55 60 

Ala Glu His Leu Leu Leu Leu Glu Gin Ala Val Thr Val Pro Gly Pro 
65 70 75 80 

Ser Val Glu Ala He Ala Glu Ala Ala Arg Lys Ala Gly Ala Val Val 

85 90 95 

Ser He Gly Val Asn Glu Arg Asp His Gly Thr Leu Tyr Asn Thr Gin 

100 105 HO 

Leu Leu Phe Asp Ala Asp Gly Arg Leu Ala Gin Ala Arg Arg Lys He 

115 120 125 

Thr Pro Thr Tyr His Glu Arg Met He Trp Gly Gin Gly Asp Gly Ser 

130 135 140 

Glv Leu Val Ala Val Asp Thr Arg Val Gly Arg He Gly Ser Leu Ala 
14 5 150 155 160 

Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met Ala Asp 

165 170 175 

His Glu Gin He His Val Ala Met Phe Pro Gly Ser Leu Val Gly Asp 

180 185 190 

He Phe Arg Glu Gin He Glu Val Thr He Arg His His Ala Leu Glu 

195 200 205 

Ser Gly Cys Phe Val Val Asn Ala Thr Gly Tyr Leu Ser Asp Ala Gin 

210 215 220 

Val Thr Gin He Ala Gly Asp Thr Lys Leu Asp Arg Ala Leu Arg Gly 
225 230 235 240 

Glv Cvs Phe Thr Ala He Val Ser Pro Glu Gly Thr Leu Leu Ala Pro 

* 245 250 255 

Pro Leu Thr Asp Gly Glu Gly Met Val He Ala Asp Leu Asp Leu Ser 

260 265 . 270 

Leu He Ala Lys Arg Lys Arg Met Met Asp Ser Val Gly His Tyr Ser 
275 280 285 
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Arg Pro Glu Leu Leu Ser Val Leu He Asp Arg Ser Pro Gin Pro His 

290 295 300 

Leu Arg Glu Lys Thr Ala Ser Leu Pro Glu Pro Arg Met Ser His Glu 
305 310 315 320 

Ser Leu Ala Pro Asp Ser Asn Gly Leu Arg Asp Ala Asp Ala Lys Ala 
325 330 335 

Ser Thr Leu Ser 
340 

<210> 331 
<211> 1041 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 331 

atgtccggaa caggacctat cggaccggtc gtgaaggtcg ccgtcgtaca agccgcgcca 60 

tgtctgctcg atcttgatgc gggcgtcgag aaggcgatcg ccttgatcga ccaagcgggc 120 

aaggctggcg cgcggctcat caattttccc gaaatttggc tgccgggtta tccttggtgg 180 

atctggctga atccacccgc catcaacatg cagtatgtcg cgccttacat gaacaactcc 240 

attgtcgcag gcagcaagca tgaccacgca cttcgggccg ccgcgcgccg caacaatatt 300 

cacgtcgtga tcggtgtctc "cgagcgcgcc ggcggcagtc tgtacatggc tcagtggcac 360 

tatgggcccg agggcgaggt gatctcgcgt cgtcgtaagc taaaacccac ccatgtcgaa 420 

cgcagcgtct ttggcgaagg cgacggcagc gacatgatcg ttactcagac cgatttcggg 480 

cgcgtcggcg cgctatgctg ttgggaacac ctgcaaccgc tgtcgaaata cgcgctcttt 540 

tcccaggacg agcagattca ttgcgcggcg tggcccgctt tcagcctcta tgcaaaactc 600 

tcgaaggcct tcagccccga agtcagcgtc aacgtgaacc aaatctacgc cgtagaaggg 660 

caatgcttcg tcctgtcgtc gtgctcggtt atcgatcagg cgatctacga cacgttggtg 720 

cagaacgaat tgcaccagaa gttcctcgag gtgggcgggg gctacagccg aatcttcggg 780 

ccgaacggtg cggaattcgg tgagaatctc ccacccgata gggaaggcct ggtggttgcc 840 

gatatcgatc tcggcctgat ctcgcactcc aagagtgccg ctgatccggc tggtcactac 900 

gcgagggccg acgctctggc gctcatgcat aaccgtaatc cccgccgtcc ggttatcggt 960 

ttcggcgagg cgacgcgcaa ggttgccgac gcactgccta aaggcgcgga acccgcggaa 1020 

gcgctcgaag cggccgagtg a 1041 

<210> 332 
<211> 346 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 332 



Met 


Ser 


Gly 


Thr 


Gly 


Pro 


He 


Gly 


Pro 


Val 


Val Lys Val Ala Val 


Val 


1 








5 










10 


15 




Gin 


Ala 


Ala 


Pro 


Cys 


Leu 


Leu 


Asp 


Leu 


Asp 


Ala Gly Val Glu Lys 


Ala 








20 










25 




30 




He 


Ala 


Leu 


He 


Asp 


Gin 


Ala 


Gly 


Lys 


Ala 


Gly Ala Arg Leu He 


Asn 






35 










40 






45 




Phe 


Pro 


Glu 


He 


Trp 


Leu 


Pro 


Gly 


Tyr 


Pro 


Trp Trp He Trp Leu Asn 




50 










55 








60 




Pro 


Pro 


Ala 


He 


Asn 


Met 


Gin 


Tyr 


Val 


Ala 


Pro Tyr Met Asn Asn 


Ser 


65 










70 








75 


80 


He 


Val 


Ala 


Gly 


Ser 


Lys 


His 


Asp 


His 


Ala 


Leu Arg Ala Ala Ala 


Arg 










85 










90 


95 




Arg 


Asn 


Asn 


He 


His 


Val 


Val 


He 


Gly 


Val 


Ser Glu Arg Ala Gly Gly 






100 










105 




110 




Ser 


Leu 


Tyr 


Met 


Ala 


Gin 


Trp 


His 


Tyr 


Gly 


Pro Glu Gly Glu Val 


He 
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<213> Unknown 
<220> 

<223> Obtained from an environmental sample 
<400> 334 

Met Ala lie Glu His Pro Arg Tyr Arg Val Ala Ala Val Gin Ala Ala 

15 10 15 

Pro Glu Phe Leu Asn Leu Glu Ala Thr Val Asp Lys Thr He Ala Leu 

20 25 30 

He Glu Glu Ala Ala Arg Gly Gly Ala Ser Leu He Ala Phe Pro Glu 

35 40 45 

Thr Trp He Pro Gly Tyr Pro Trp Phe Ala Trp Leu Gly Ala Pro He 

50 55 60 

Trp Gly Met Lys Phe He Gin Ala Tyr His Asp Asn Ser Met Val He 
65 70 75 80 

Asp Gly Ala Gin Phe Glu Arg He Ala Gin Ala Ala Ser Arg Cys Asn 

85 90 95 

He Thr Val Val Leu Gly Phe Ser Glu Lys Asp Ala Gly Ser Leu Tyr 

100 105 HO 

He Ala Gin Ala He Leu Ser Pro Glu Gly Lys Thr He Ala Thr Arg 

115 120 125 

Arg Lys Leu Lys Pro Thr His Val Glu Arg Ala He Phe Gly Glu Gly 

130 135 140 

Asp Gly Ser Asp Leu Ala Val His Asp Thr Lys Leu Gly Arg Val Gly 
145 150 155 160 

Ala Leu Cys Cys Trp Glu His Leu Gin Pro Leu Ser Lys Tyr Ala Met 

165 170 175 

Tyr Ala Gin Asn Glu Gin Val His He Ala Ala Trp Pro Ser Phe Ser 

180 185 190 

Leu Tyr Val Asp Ala Ala Tyr Ala Leu Gly Pro Glu Val Asn Asn Ala 

195 200 205 

Ala Ser Arg Leu Tyr Ala Val Glu Gly Gin Cys Phe Val Val Ala Pro 

210 215 220 

Cys Ala Thr Val Ser Gin Lys Met He Asp Met Leu Cys Glu Thr Pro 
225 230 235 240 

Glu Gin Gin Ala Leu Leu Lys Pro Gly Gly Gly His Ala Gin He Tyr 

245 250 255 

Gly Pro Asp Gly Arg Ser Leu Ala Asp Pro Leu Pro Pro Asp Ala Glu 

260 265 270 

Gly Leu Leu Tyr Ala Asp He Asp Leu Ala Ala He Thr Leu Ala Lys 

275 280 285 

Ala Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp Val Thr Gin 

290 295 300 

Leu Leu Leu Asp Arg Asn Pro Lys Pro Arg Val Val His Ala Lys Pro 
305 310 315 320 

Gly Gin Ser Ala Asn Asn Ser Ser Pro Gly Met Arg Ala Val Glu His 

325 330 335 

Thr Glu Leu Glu Glu Gly Glu Gin Ala 
340 345 

<210> 335 
<211> 1053 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 335 

atgactacgc gcgtgattaa agtcgctgcc gcgcagctgt ctccggtgct ggcaactgca 60 
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tcggagcaca gccgcgagga cacgattgcg aaagtgatcg acgccatcgc tgcggcgtcg 120 

caacagggcg cgcaactgat tgtgtttccg gaaacagtcg ttccgtatta cccgtatttt 180 

tcatttatca cgcccgcggt aacgatgggt gccgaacatt tgaaattgta cgaacaggca 240 

gtgacggtac ccagcgcagc gacagatgct gtcgctgcgg cggcaaaaaa ttatggcatg 300 

gtagtggtgc tcggaattaa tgaacgcgat cacggctcgc tgtacaacgc gcaattaatt 360 

ttcgacgccg atggtgagct gctattaaag cgtcgaaaaa ttacgccgac ttatcacgaa 420 

cgcatggtgt ggggacaagg cgacggcagc ggtttaaaag ttgtcgatac tgctgccggc 4 80 

cgtgtcggcg cgctcgcgtg ctgggaacat tacaacccgc tcgcgcgtta cagcctgatg 540 

gcacaacacg aagaaattca ttgcagtcaa tttccgggat cgttggtcgg ccctattttc 600 

gccgaacaaa tggaaatcac catgcgtcat cacgcactcg aatccggctg ttttgtggtg 660 

aatgcaacgg cctggcttag cgatacacaa atccaatcaa ttacccccga taaagccatg 720 

cagaaagcac tgcgcggcgg ttgctacacc gcaatcatct cgcccgaagg caaacatctg 780 
tgcccaccgc tgtatgacgg agaaggaata attgtggcgg aattggactt cgcgttaatc 
accaaacgta aacgcatgat ggattccgtc ggccattacg cgcgaccaga actactttct 
ttgctcctcg atgatcgcgt aactgcgccg ctcaaaaact tacagacgac gatggcctct 

gccaaatccg ctgaagatgg ttttccttta tttgcagacg ttttatatcc agacagttct 1020 

ttcattgaga cgtcgaaatt cgcggagtca tga 1053 

<210> 336 
<211> 350 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 336 

Met Thr Thr Arg Val He Lys Val Ala Ala Ala Gin Leu Ser Pro Val 

15 10 15 

Leu Ala Thr Ala Ser Glu His Ser Arg Glu Asp Thr He Ala Lys Val 

20 25 30 

He Asp Ala He Ala Ala Ala Ser Gin Gin Gly Ala Gin Leu He Val 

35 40 45 

Phe Pro Glu Thr Val Val Pro Tyr Tyr Pro Tyr Phe Ser Phe He Thr 

50 55 60 

Pro Ala Val Thr Met Gly Ala Glu His Leu Lys Leu Tyr Glu Gin Ala 
65 70 75 80 

Val Thr Val Pro Ser Ala Ala Thr Asp Ala Val Ala Ala Ala Ala Lys 

85 90 95 

Asn Tyr Gly Met Val Val Val Leu Gly He Asn Glu Arg Asp His Gly 

100 105 HO 

Ser Leu Tyr Asn Ala Gin Leu He Phe Asp Ala Asp Gly Glu Leu Leu 

115 120 125 

Leu Lys Arg Arg Lys He Thr Pro Thr Tyr His Glu Arg Met Val Trp 

130 135 140 

Gly Gin Gly Asp Gly Ser Gly Leu Lys Val Val Asp Thr Ala Ala Gly 
145 150 155 160 

Arg Val Gly Ala Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg 

165 170 175 

Tyr Ser Leu Met Ala Gin His Glu Glu He His Cys Ser Gin Phe Pro 

180 185 190 

Gly Ser Leu Val Gly Pro He Phe Ala Glu Gin Met Glu He Thr Met 

195 200 205 

Arg His His Ala Leu Glu Ser Gly Cys Phe Val Val Asn Ala Thr Ala 

210 215 220 

Trp Leu Ser Asp Thr Gin He Gin Ser He Thr Pro Asp Lys Ala Met 
225 230 235 240 

Gin Lys Ala Leu Arg Gly Gly Cys Tyr Thr Ala He He Ser Pro Glu 

245 250 255 

Gly Lys His Leu Cys Pro Pro Leu Tyr Asp Gly Glu Gly He He Val 
260 265 270 
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Ala Glu Leu 


Asp 


Phe Ala Leu 


He 


275 






280 


Ser Val Gly 


His 


Tyr Ala Arg 


Pro 


290 




295 




Asp Arg Val 


Thr 


Ala Pro Leu 


Lys 


305 




310 




Ala Lys Ser 


Ala 


Glu Asp Gly Phe 






325 




Pro Asp Ser 


Ser 
340 


Phe He Glu 


Thr 



Thr 


Lys 


Arg 


Lys 


Arg 
285 


Met 


Met 


Asp 


Glu 


Leu 


Leu 


Ser 
300 


Leu 


Leu 


Leu 


Asp 


Asn 


Leu 


Gin 
315 


Thr 


Thr 


Met 


Ala 


Ser 
320 


Pro 


Leu 
330 


Phe 


Ala 


Asp 


Val 


Leu 
335 


Tyr 


Ser 


Lys 


Phe 


Ala 


Glu 


Ser 







345 350 



<210> 337 
<211> 957 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 337 

atgaagggag tgtgtgccat gtcgacgaaa gttgccatcg tccaggcgcc gccggtcctg 60 

ctgcatcgcg acaggacgat tgcgaaggtg cgttcgtcga tcgaggatgc cgccaatgcg 120 

ggcgcctcgc tgatcgtatt tcccgaagct tatgtacccg gctatccgag ttggatctgg 180 

cgtctcaggc ccggaggcga catgggactg tcgtctgaga ttcacgcaag attgcgggaa 240 

aatgccgttg atctcgcgaa cggaggcctg gcgcatgtcc agggggctgc agcaaaattc 300 

ggcgcgactg tcgttatcgg catcaatgaa ctcgacagcg agttcagcgg aacgacattg 360 

ttcaacaccg tggtggtcat cggccccgac ggaacgcgcc tcaacaggca tcgaaaatta 420 

atgccgacca acccggagcg catggtgtgg ggcacgggcg atgcctcggg tctgcgtgtc 480 

atcgatacgc cggcgggacg gctgggaacc atgatctgct gggagagcta catgccgctg 540 

gcgcgctatg ctctctatgc gcaaggcatc gagatatacg tcgctcccac gtgggacgca 600 

ggcgagagct ggattgctac gatgcgccac atcgccaagg aggccggctg ctgggtgatc 660 

ggcacggcaa ccgtcatcca gggcagcgat gttccggacg attttcccga acgcgacaag 720 

ctcttcaagc cggaggagtg gatcaacgac ggcgatgcgg tcgtggtcaa gcccatgggc 780 

gcgattgctg ccggaccgca caatcgacag aaaagcatac tctacgccga catcgaccgg 840 

gaggccgcgc ggcgagcccg ccggtcgctc gatgtctgtg gccactattc ccgcccagac 900 

gttttctctt tctcggtcaa ccgaaagcca ttccgccctg ccgactttgt gggttga 957 



<210> 338 
<211> 313 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 338 
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115 120 125 

Thr Arg Leu Asn Arg His Arg Lys Leu Met Pro Thr Asn Pro Glu His 

130 135 140 

Gly Val Gly His Gly Arg Cys Leu Gly Ser Ala Cys His Arg Tyr Ala 
145 150 155 160 

Gly Gly Thr Ala Gly Asn His lie Cys Trp Glu Ser Tyr Met Pro Leu 

165 170 175 

Ala Arg Tyr Ala Leu Tyr Ala Gin Gly lie Glu lie Tyr Val Ala Pro 

180 185 190 

Thr Trp Asp Ala Gly Glu Ser Trp lie Ala Thr Met Arg His lie Ala 

195 200 205 

Lys Glu Ala Gly Cys Trp Val lie Gly Thr Ala Thr Val lie Gin Gly 

210 215 220 

Ser Asp Val Pro Asp Asp Phe Pro Glu Arg Asp Lys Leu Phe Lys Pro 
225 230 235 240 

Arg Ser Gly Ser Thr Thr Ala Met Arg Ser Trp Ser Ser Pro Trp Ala 

245 250 255 

Arg Leu Leu Pro Asp Arg Thr lie Asp Arg Lys Ala Tyr Ser Thr Pro 

260 265 270 

Thr Ser Thr Gly Arg Pro Arg Gly Glu Pro Ala Gly Arg Ser Met Cys 

275 280 285 

Gly His Tyr Ser Arg Pro Asp Val Phe Ser Phe Ser Val Asn Arg Lys 

290 295 300 

Pro Phe Arg Pro Ala Asp Phe Val Gly 
305 310 

<210> 339 

<211> 1020 

<212> DNA 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 339 

atgtcaggat cattcaaagt tgcagcagtg caggcggctc cggccttcct caatctcgat 60 

gcgggcatcg acaaggcggt ggcgctgatc gagcaagccg ccgcgcagga cgttcagctc 120 

atcgcctttc ccgagacctg gttgcccgga tacccctggt ggatctggct cgatgcaccc 180 

gccgttacca tgggctatgt cgttccctac aatctcaatt cactagaggc gggcagcccg 240 

caggacaagc gtctggcaaa tgccgcgcgt gagaacaaca tccaggtggt gatgggtctg 300 

tctgaacgcc atgacggcac gctctacatc gcgcagtggc actatggtga agatggcgag 360 

gtgatctcgc gacggcgcaa gctcaagccg acccatgtcg aacgcacggt gttcggggaa 420 

ggcgacggca gcgacatggt ggtcaaggac acaagtctgg gacgggtcgg cgctctgtgc 4 80 

tgttgggaac acctgcagcc gctcaacaaa tacgcgatgt actcccagaa cgagcagatc 540 

cacatcggtt cctggcccag cttcagcctc tacaagggcg gcgcctatgc gcttggggca 600 

gacctcaaca cggcggcaag ccagatgtat gcggccgagg gccagtgctt tgttctggct 660 

gcctgcgcca cggtcagtca ggacatgttc gacatgctct gcgacacgga aatgaagcag 720 

cagttcctga ccaccggtgg cggattcgcc cgcattttcg gacctgacgg ctcgcccatg 780 

ggcaatgtgc ttgaagaaca tgaagaaggg ctggtgatcg ccgaaatcga tctcaccatg 840 

attgcgatcg ccaaggcggc ggctgacccg tgcgggcact attcccggcc cgatgtgttc 900 

cgactgatgt tcaaccagaa gccaagcccg gtggtgatgc cattcgaaaa cgacgtggcc 960 

cgcgagattg tcgaggccgc cgaggatcag gtcagcgcaa accggctcgc agcggaatga 1020 

<210> 340 
<211> 329 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
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<400> 340 

Met Ser Gly Ser Phe Lys Val Ala Ala Val Gin Ala Ala Pro Ala Phe 

15 10 15 

Leu Asn Leu Asp Ala Gly lie Asp Gly Gly Gly Ala Asp Arg Ala Ser 

20 25 30 

Arg Arg Ala Gly Arg Ser Ala His Arg Leu Ser Arg Asp Leu Val Ala 

35 40 45 

Gly Tyr Pro Trp Trp lie Trp Leu Asp Ala Pro Ala Val Thr Met Gly 

50 55 60 

Tyr Val Val Pro Tyr Asn Leu Asn Ser Arg Gly Gly Gin Pro Ala Gly 
65 70 75 80 

Gin Ala Ser Gly Lys Cys Arg Ala Glu Gin His Pro Gly Gly Asp Gly 

85 90 95 

Leu Ser Glu Arg His Asp Gly Thr Leu Tyr lie Ala Gin Trp His Tyr 

100 105 110 

Gly Glu Asp Gly Glu Val He Ala Thr Ala Gin Ala Gin Ala Asp Pro 

115 120 125 

Cys Arg Thr His Gly Val Arg Gly Arg Arg Arg Gin Arg His Val Val 

130 135 140 

Lys Asp Thr Ser Leu Gly Arg Val Gly Ala Leu Cys Cys Trp Glu His 
145 150 155 160 

Leu Gin Pro Leu Asn Lys Thr Arg Cys Thr Pro Arg Thr Ser Arg Ser 

165 170 175 

Thr Ser Val Pro Gly Pro Ala Ser Ala Ser Thr Arg Ala Ala Leu Cys 

180 185 190 

Ala Trp Gly Arg Pro Gin His Gly Gly Lys Pro Asp Val Cys Gly Arg 

195 200 205 

Gly Pro Val Leu Cys Ser Leu Pro Ala Pro Arg Ser Val Arg Thr Cys 

210 215 220 

Ser Thr Cys Ser Ala Thr Arg Lys Ser Ser Ser Ser Thr Gly Gly Gly 
225 230 235 240 

Phe Ala Arg He Phe Gly Pro Asp Gly Ser Pro Met Gly Asn Val Leu 

245 250 255 

Glu Glu His Glu Arg Ala Gly Asp Arg Arg Asn Arg Ser His His Asp 

260 265 270 

Cys Asp Arg Gin Gly Gly Gly Pro Val Arg Ala Leu Phe Pro Ala Arg 

275 280 285 

Cys Val Pro Thr Asp Val Gin Pro Glu Ala Lys Pro Gly Gly Asp Ala 

290 295 300 

He Arg Lys Thr Trp Pro Ala Arg Leu Ser Arg Pro Pro Arg He Arg 
305 310 315 320 

Ser Ala Gin Thr Gly Ser Gin Arg Asn 
325 

<210> 341 
<211> 1056 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 341 

atggcactga ccaacccaaa atataaagtc gccgccgtcc aggccgcgcc agcgttcctc 60 

gatctggatg cgtccgtcga aaaagcggtc cggctgatcg acgaggccgg cgccaagggc 120 

gcgcgcctca ttgcgttccc ggaaacctgg atccccggct atccctggtg gatctggctc 180 

ggcgcgccgg cctgggcgat catgaaaggt tttgtctcgg cctatttcga caattcgctc 240 

acctatgaca gtccggccgc ggacaaattg cgccaggccg ccaagcgcaa cgatatcgtc 300 

gtggtgctcg gcctgtcgga gcgcgacggc ggcagtctct atatcgcgca atggatcatc 360 

ggcccggacg gcgaaactgt cgcccagcgc cgcaagctca agccgaccca tgtcgagcgt 420 

tcggtgttcg gcgagggcga tggcagcgac cttgccgtgc atgagctcgc gatcgggcgc 480 
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gtcggcgcgc tgtgctgctg ggagcatctg caaccgctgt cgaaatacgc catgtatgcg 540 

cagaacgagc aggttcacgt ggcggcctgg ccgagctttt cgctgtacga tccgttcgcc 600 

catgccctcg gtgcggaggt caacaacgcc gccagcaaga tctacgcggt cgagggatcg 660 

tgcttcgtcg tggcaccctg cgccaccgtt tcaaaggaaa tgatcgacct gttgtgcgac 720 

acgccggaca agcacggtct gctgcacgcc ggcggcgggt ttgccgcgat ctacggccct 780 
gacggctcgc cgatcggcga ccgcctggcg cccgaccagg aaggtctgat ctatgccgat 
gtcgatctcg gcatgatctc ggtcgcgaag gccgccgccg atccggccgg acattatgcg 
cggccggatg tcacaaggtt gctgctcaac aaacgcccgg gcaatcgtgt cgaggcgctg 

gcacttccgg tggatcaggt tgcggcaggt gaggagatcc cctcgatatc gcgatcggcc 1020 

agaggggttg ccgaactgcc aaacgcggcc gaatag 1056 

<210> 342 
<211> 342 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 342 

Met Ala Leu Thr Asn Pro Lys Tyr Lys Val Ala Ala Val Gin Ala Ala 

15 10 15 

Pro Ala Phe Leu Asp Leu Asp Ala Pro Ser Lys Lys Arg Ser Gly Ser 

20 25 30 

Thr Arg Pro Ala Pro Arg Ala Arg Ala Ser Leu Arg Ser Arg Thr Trp 

35 40 45 

lie Pro Gly Tyr Pro Trp Trp lie Trp Leu Gly Ala Pro Ala Trp Ala 

50 55 60 

lie Met Lys Gly Phe Val Ser Pro He Ser Thr He Arg Ser Pro Met 
65 70 75 80 

Thr Val Arg Pro Arg Thr Asn Cys Ala Arg Pro Pro Ser Ala Thr Tyr 

85 90 95 

Arg Arg Gly Ala Arg Pro Val Gly Ala Arg Arg Arg Gin Ser Leu Tyr 

100 105 HO 

Arg Ala Met Asp His Arg Pro Thr Ala Lys Leu Ser Pro Ser Ala Ala 

115 120 125 

Ser Ser Ser Arg Pro Met Ser Ser Val Arg Cys Ser Ala Arg Ala Met 

130 135 140 

Ala Ala Thr Leu Pro Cys Met Ser Ser Arg Ser Gly Ala Ser Ala Arg 
145 150 155 160 

Cys Ala Ala Gly Ser He Cys Thr Ala Val Glu He Arg His Val Cys 

165 170 175 

Ala Glu Arg Ala Gly Ser Arg Gly Gly Leu Ala Glu Leu Phe Ala Thr 

180 185 190 

He Arg Ser Pro Met Pro Ser Val Arg Arg Ser Thr Thr Pro Pro Ala 

195 200 205 

Arg Ser Thr Arg Ser Arg Asp Arg Ala Ser Ser Trp His Pro Ala Pro 

210 215 220 

Pro Phe Gin Arg Lys Ser Thr Cys Cys Ala Thr Arg Arg Thr His Gly 
225 230 235 240 

Leu Leu His Ala Gly Gly Gly Phe Ala Ala He Tyr Gly Pro Asp Gly 

245 250 255 

Ser Pro He Gly Asp Arg Trp Arg Pro Thr Arg Lys Val Ser Met Pro 

260 265 270 

Met Ser He Ser Ala Ser Arg Ser Arg Arg Pro Pro Asp Pro Ala Gly 

275 280 285 

His Tyr Ala Arg Pro Asp Val Thr Arg Leu Leu Leu Asn Lys Arg Pro 

290 295 300 

Glv Asn Arg Val Arg Arg Trp His Phe Arg Trp He Arg Leu Arg Gin 
305 310 315 320 

Val Arg Arg Ser Pro Arg Tyr Arg Asp Arg Pro Glu Gly Cys Arg Thr 
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325 330 335 

Ala Lys Arg Gly Arg lie 



60 



340 

<210> 343 
<211> 942 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 343 

atgaagtcaa aaattgcggt tattcagcga cctcctgtat tgttagacct tcaggcttca 

atcgcccgtg ccattacttc tgttgaggaa gcggcaggca agggatctga gctccttgtt 120 

tttcccgaga catttctgcc tggttatccg tcctggatct ggcgtctcaa gccgggcgga 180 

gacatggtgc tgacatctga aatccacgca aaatatcgcg cgaactctgt tgatgttgag 240 

cgcggggatc tggccccttt atgcgaagcg gcggcgaaac acggcgtcac aattgtcatg 300 

gggctcagtg aaattgatgg gcgctacagc gggactacac tctttaatac agtggtgacc 360 

attggcgcgg aaggagagct ccttaataga caccgcaagc tcatgccgac aaacccagag 420 

cgtatggtct gggggcaagg ggatgcctct ggtctgcggg ttgtcgacac gcccgtgggc 480 

cgcgtcggca cgctgatctg ctgggaaaac tacatgccgc tatcgcgcta tgcgctttat 540 

tctcaaaaca ttgacatcta tgtggcgccg acctgggacg cgggcgagag ctggatcgcc 600 

tccatgcagc atatcgccaa agaaggtggc tgctgggtga tcggcacggc cacggcgatg 660 

gagggctctg atgtcccagc cgacttccct cagcgggagg tgcttttccc tgatagcagc 720 

gaatggatca atgacggtga cgctgtagtg gttaaaccca tgggggcgat tgtcgcgggt 780 

ccgcatcacc gggataagag tattctctat gctgagattg acgtcgaagt ggcacgcaat 840 

gcgcggcgct cgctcgatgt ggcggggcat tactcccggc cggatatttt ttcctttggc 900 

gtggatcgcc ggcctttgcc gccggttacg tttgaggatt ga 942 

<210> 344 
<211> 303 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 344 

Met Lys Ser Lys He Ala Val lie Gin Arg Pro Pro Val Leu Leu Asp 

15 10 15 

Leu Gin Ala Ser He Ala Arg Ala He Thr Leu Leu Arg Lys Arg Gin 

20 25 30 

Ala Arg Asp Leu Ser Ser Leu Phe Phe Pro Arg His Phe Cys Leu Val 

35 40 45 

He Arg Pro Gly Ser Gly Val Ser Ser Arg Ala Qlu Thr Trp Cys His 

50 55 60 

Leu Lys Ser Thr Gin Asn He Ala Arg Thr Leu Asp Val Glu Arg Gly 
65 70 75 80 

Asp Leu Ala Pro Leu Cys Glu Ala Ala Ala Lys His Gly Val Thr He 

85 90 95 

Val Met Gly Gin Asn Trp Ala Leu Gin Arg Asp Tyr Thr Leu Tyr Ser 

100 105 HO 

Gly Asp His Trp Arg Gly Arg Arg Leu Leu Asn Arg His Arg Lys Leu 

115 120 125 . 

Met Pro Thr Asn Pro Glu Arg Met Val Trp Gly Gin Gly Asp Ala Ser 

130 135 140 

Val Cys Gly Leu Ser Thr Arg Pro Trp Ala Ala Ser Ala Arg Ser Ala 
14 5 150 155 160 

Glv Lys Thr Thr Cys Arg Tyr Arg Tyr Ala Leu Tyr Ser Gin Asn He 
165 170 175 
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Asp lie 


Tyr 


Val 


Ala Pro Thr 


Trp 


Asp 


Ala 


Gly Glu 


Ser 


Trp 


He 


Ala 






180 






185 










190 






Ser Met 


Gin 


His 


lie Ala Lys 


Glu 


Gly 


Gly 


Cys 


Trp 


Val 


He 


Gly 


Thr 




195 






200 










205 








Ala Thr 


Ala 


Met 


Glu Gly Ser 


Asp 


Ser 


Gin 


Pro 


Thr 


Ser 


Leu 


Ser 


Gly 


210 






215 










220 










Arg Cys 


Phe 


Ser 


Leu lie Ala 


Ala 


Asn 


Gly 


Ser 


Met 


Thr 


Val 


Thr 


Leu 


225 






230 








235 










240 


Trp Leu 


Asn 


Pro 


Trp Gly Arg 


Leu 


Ser 


Arg 


Val 


Arg 


He 


Thr Gly 


He 






245 






250 










255 




Arg Val 


Phe 


Ser 


Met Leu Arg 


Leu 


Thr 


Arg 


Ser 


Gly 


Thr 


Gin 


Cys 


Ala 




260 






265 










270 






Ala Leu 


Ala 


Arg 


Cys Gly Gly 


Ala 


Leu 


Leu 


Pro 


Ala 


Gly 


Tyr 


Phe 


Phe 




275 






280 










285 








Leu Trp 


Val 


Asp 


Arg Arg Pro 


Leu 


Pro 


Pro 


Val 


Thr 


Phe 


Glu 


Asp 




290 






295 










300 











<210> 345 
<211> 1011 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 345 

atgaaagcga tcaaggccgc tgccgtgcag gcagcaccgg tattcctgaa cctcgacgca 60 

tcgat caeca aggeggaaac attcgtcgcc gaggccgccg cgaatggtgc caagctggtg 120 

gcgtttccgg aaacctggct geegggctat ccctggttca tetggctegg tgcgcccgcc 180 

gaaggcatgc agttcatccc gcgctatcac gaaaacagca tggagctccg ctcgcccgag 240 

atgcgccgct tgeaggegat cgcgcgcaag tatgaagtga cgctcgtcat gggctattcc 300 

gagegegatg gtggcagccg ctacatgtcc caggtcatta ttggcgatca gggegacate 360 

cttctcaatc geegtaaatt gaagccaacc catgtcgagc ggaeggtett eggegaagge 420 

gaeggttegg acctggtggt ggtcgaaacg geatteggea ggctcggtgc getcaattge 4 80 

tgggaacata tccagccgct cgtcaagatg tcgatgtatg cccagcatga ggaaatccat 540 

gtcgcgggtt ggecgagett ctgcgtctac cgcgatctcg cctatgccct gggaccggaa 600 

gtcaacaatg ccgtcagtca ggtctatgee gtggagggta gcgcctatgt tctggcaccc 660 

tgtgcgatcg taagccagga gatgttcgac attctggccg acaagectga aaaggccttt 720 

ctcctcaatc cccgcacatc caagcccggc ggtggcttca cgcagatcta tgegceggat 780 

ggtcgaccgc tttgegagee gcttgccgac gatgtggaag gcatcctcta tgccgatctc 840 

gatceggcaa cgatcgccgt cgcgaaggcg gccgccgatc ctgeggggea etattcgegg 900 

ccggacgcac tctcgctggt gatcaatege gaaaagcgcg cggtgatggc tgaaatcaac 960 

gcgccggcga cgccgacctt cacccccatc tccctggacg ctgcggagta g 1011 



<210> 346 
<211> 329 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 346 

Met Lys Ala He Lys Ala Ala Ala 

1 5 
Asn Leu Asp Ala Ser lie Thr Lys 
20 

Ala Ala Asn Gly Ala Lys Leu Val 

35 40 
Gly Tyr Pro Trp Phe He Trp Leu 
50 55 



Val Gin Ala Ala Pro Val Phe Leu 

10 15 
Ala Glu Thr Phe Val Ala Glu Ala 
25 30 
Ala Phe Pro Glu Thr Trp Leu Pro 
45 

Gly Ala Pro Ala Glu Gly Met Gin 
60 
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<210> 348 
<211> 297 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 348 

Met Lys Val Ala Ala He Gin Ala Ala Pro Val Tyr Leu Asp Arg Gin 

I 5 10 15 

Ala Thr Leu Glu Lys Ala Leu Ser Trp Met Lys Arg Pro Gin Thr Ala 

20 25 30 

Pro Lys Ser Ala Pro Ser Leu Arg Pro Ser Ser Arg Ala He Pro Trp 

35 40 45 

Met Asp Leu Thr Asp Gly Ala Lys Trp Asn Asp Asp Lys Gin Lys Ala 

50 55 60 

Ala Tyr Ala Cys Tyr Val Asp Ala Ala Val Glu Ala Asp Gly Pro Glu 
65 70 75 80 

Leu Gin Ala He Ala Lys Lys Ser Lys Ala Leu Gly Leu Phe Thr He 

85 90 ' 95 

Trp Ala Trp Ser Asn Ala Arg Arg Arg Pro Gly Gin Tyr He Val Pro 

10O 105 HO 

Ser Leu Pro Ser He Pro Thr Arg Val Ser Ser Ala Tyr Thr Glu Asn 

115 120 125 

Ser Cys Pro Pro Thr Pro Ser Ala Ser Cys Gly Ala Lys Ala Thr Asp 

130 135 140 

Thr Asp Ser Arg Cys Met Asn Ser Pro Ala Ser Lys Ser Ala Arg Thr 
145 150 155 160 

Ala Gly Lys He Gly Cys Pro Ala Arg Tyr Ala Met Tyr Ala Gin Gly 

165 170 175 

Glu Gin Leu His Val Ala Thr Trp Pro Gly Ser Pro Trp Leu Thr Lys 

180 185 190 

Asp He Thr Arg Phe He Ala Leu Glu Gly Arg He Tyr Val Met Ser 

195 200 205 

Val Gly Gly Val Leu Ser Ala Asn Asp He Pro Asp Ser Phe Pro Leu 

210 215 . 220 

Lys Thr Asp Leu Leu Lys He Arg Asp Arg Tyr Leu Ser Gly Gly Thr 
225 230 235 240 

Asp Ser Arg Pro Arg Arg His His Pro Arg Arg Pro Arg Gin Lys Arg 

245 250 255 

Arg Asp His Thr Leu Arg Gly Leu Asp Leu Asn Thr Val Leu Gin Glu 

260 265 270 

Arg Gin Asn Phe Asp Pro Ala Gly His Tyr Ala Arg Pro Asp Val Ser 

275 280 285 

Asn Trp Lys Ser Thr Lys He Asp Asp 
290 295 

<210> 349 
<2H> 1002 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 349 

atgacgtccc ctgttcaaac caagtacaaa gtcgcctgtg ttcaggcggc gcccgagttt 
ctcgatctcg acaaaggcgt tgccaaagcg gtgcgcctga tcgaagaagc cgccacccaa 120 
aaggcctcgc tgatcgcgtt tcccgaagtc tggctccccg gctatccgtg gtggatctgg 180 
ctcgactcgc cggcctgggg cttgcagttc gtccagcgct acttcgaaaa cgctctggtc 240 



60 
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840 
900 
960 



gtcggcagcc cccaatggga gcgcttgtgc aaggccgccg ccgacaacaa tatccatgtt 300 

gtgctcggat tctccgaacg ggacggcagc acgctgtacc tcgcacaggc catcatcgat 360 

aacaccggga aggtgatcgc cacgcggcgc aaactcaagc caacacacgc cgaacgcacg 420 

gttttcggcg aaggcgacgg gagccacatc gcggtgcatg aaaccacttt gggccgcatg 480 

ggtgcactct gctgcgccga gcacatccag ccactgacca agtacgccat gtactcgcag 540 

cacgagcaga ttcacattgc cgcatggccc agcttctcgg tctaccgcgg agcagcgttc 600 

cagctgagcg ccgaagccaa caacgccgcg agccaggtct atgccctgga gggcagttgc 660 

tacgtggtgg ccccttgcgc gacggtgtcc aaggagatgt tggacatgct ggctgattcg 720 

ccgcaaaaga agcagctcct gctggaaggc ggtggctacg ccatgattta tgggcccgac 780 
gccaagcccc tgtgcgagcc cattccagag acagaagaag gcattcttta cgcagatgtg 
gacctgggct tcatcggtgt caccaaggca gcgtatgacc ccgccggtca ctattcacgc 
cccgacgtgc tgcgcctttt gttcaatcgg aagcctgccc ctcgggttca cgatttcgat 

cctgaata'ca cggccaccga gcagaagaca gacgcggcct ga 10 02 

<210> 350 
<211> 333 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 350 

Met Thr Ser Pro Val Gin Thr Lys Tyr Lys Val Ala Cys Val Gin Ala 

15 10 15 

Ala Pro Glu Phe Leu Asp Leu Asp Lys Gly Val Ala Lys Ala Val Arg . 

20 25 30 

Leu He Glu Glu Ala Ala Thr Gin Lys Ala Ser Leu He Ala Phe Pro 

35 40 45 

Glu Val Trp Leu Pro Gly Tyr Pro Trp Trp He Trp Leu Asp Ser Pro 

50 55 60 

Ala Trp Gly Leu Gin Phe Val Gin Arg Tyr Phe Glu Asn Ala Leu Val 
65 70 75 80 

Val Gly Ser Pro Gin Trp Glu Arg Leu Cys Lys Ala Ala Ala Asp Asn 

85 90 95 

Asn He His Val Val Leu Gly Phe Ser Glu Arg Asp Gly Ser Thr Leu 

100 105 HO 

Tyr Leu Ala Gin Ala He He Asp Asn Thr Gly Lys Val He Ala Thr 

115 120 125 

Arg Arg Lys Leu Lys Pro Thr His Ala Glu Arg Thr Val Phe Gly Glu 

130 135 140 

Gly Asp Gly Ser His He Ala Val His Glu Thr Thr Leu Gly Arg Met 
145 150 155 160 

Gly Ala Leu Cys Cys Ala Glu His He Gin Pro Leu Thr Lys Tyr Ala 

165 170 175 

Met Tyr Ser Gin His Glu Gin He His He Ala Ala Trp Pro Ser Phe 

180 185 190 

Ser Val Tyr Arg Gly Ala Ala Phe Gin Leu Ser Ala Glu Ala Asn Asn 

195 200 205 

Ala Ala Ser Gin Val Tyr Ala Leu Glu Gly Ser Cys Tyr Val Val Ala 

210 215 220 

Pro Cys Ala Thr Val Ser Lys Glu Met Leu Asp Met Leu Ala Asp Ser 
225 230 235 240 

Pro Gin Lys Lys Gin Leu Leu Leu Glu Gly Gly Gly Tyr Ala Met He 

245 250 255 

Tyr Gly Pro Asp Ala Lys Pro Leu Cys Glu Pro He Pro Glu Thr Glu 

260 265 270 

Glu Gly He Leu Tyr Ala Asp Val Asp Leu Gly Phe He Gly Val Thr 

275 280 285 

Lys Ala Ala Tyr Asp Pro Ala Gly His Tyr Ser Arg Pro Asp Val Leu 
290 295 300 
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Arg Leu Leu Phe Asn Arg Lys Pro Ala Pro Arg Val His Asp Phe Asp 
305 310 315 320 

Pro Glu Tyr Thr Ala Thr Glu Gin Lys Thr Asp Ala Ala 
325 330 

<210> 351 
<211> 936 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 351 

atgattacag caggcatcgc 
gagaaagccg ttggcattat 
cccgaggcat ttattccctc 
ttcgggttgt gcgaggagtt 
gatgacctgg accctgtccg 
ttgaatgagc gcgaaggcca 
ggtccggacg gcagcatcct 
acgattcatg gcttcggcga 
gtgggtggtc tgatttgctg 
cagggcgtag aagtgtatgt 
atgcagcata ttgcccggga 
ggcagtgatt ttcccgcgga 
tgggtgaatc ccggtgggtc 
cttttccgtg aggagggcat 
aagcggtcct ttgacgtggc 
gaccgtgatc cacaggatcc 

<210> 352 

<211> 301 

<212> PRT 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 352 



Met 


He 


Thr 


Ala 


Gly 


He 


Ala 


Val 


Ala 


Ala 


Pro Val 


Val Leu Asp 


Lys 


1 








5 










10 




15 




Thr 


Lys 


Thr 


He 


Glu 


Lys 


Ala 


Val 


Ala 


Leu 


Phe Thr Arg Gin Arg 


Val 






20 










25 






30 




Arg 


Val 


Thr 


Cys 


Leu 


Cys 


Phe 


Pro 


Arg 


His 


Leu Phe 


Pro Pro He 


Pro 




35 








40 








45 




Pro 


Gly 


Val 


Gly 


Ala 


Cys 


Val 


Pro 


Val 


Glu 


He Ser Gly Cys Ala Arg 




50 










55 








60 






Ser 


Cys 


Thr 


Pro 


Cys 


Cys 


Leu 


He 


lie Arg 


Asn Leu 


Gin Gly Asp Asp 


65 






70 










75 




80 


Leu 


Asp 


Pro 


Val 


Arg 


Gly 


Ala 


Ala 


Ala 


Glu 


His Ser 


Met Thr Val 


Val 








85 








90 




95 




Met 


Gly 


Glu 


Ala 


Arg 


Arg 


Pro 


Val 


Arg 


Ser 


Gly Tyr 


Pro Val Arg 


His 






100 








105 






110 




Gly 


He 


Tyr 


Arg 


Ser 


Gly 


Arg 


Gin 


His 


Pro 


Glu Pro 


Ser Ser Glu 


Thr 




115 










120 








125 




Tyr 


Ala 


Asn 


Gin 


Ser 


Ala 


Tyr 


Asp 


Ser 


Trp 


Leu Arg 


Arg Cys Ala 


Gly 


130 










135 








140 






He 


Glu 


Ser 


Gly 


Gly 


Tyr 


Pro 


Val 


Arg 


Ser 


Arg Gly 


Trp Ser Asp 


Leu 


145 






150 










155 




160 


Leu 


Gly 


Glu 


Phe 


His 


Ala 


Pro 


Gly 


Ser 


Leu 


Gly Leu 


Tyr Ala Gin 


Gly 



agtcgccgct ccggtggtgt tggacaaaac aaaaaccatt 60 

tcacgaggca gcgggtaagg gtgtgaacct gcttgtgttt 120 

ctatcccgcc tggggttggc gcctgcgtcc cggtggagat 180 

gcacgccctg ttgcttgata attcggtaaa tttgcaaggt 240 

gggcgctgca gccgagcatt caatgaccgt ggtgatggga 300 

gttcggtcgg gctaccctgt ttaacgccat ggtatttatc 360 

gaaccatcat cggaaactta tgccaaccaa tcatgagcgt 420 

tgcgcgggga ttgaaagtgg tggatacccc gtgcggtcgc 480 

ggagaatttc atgcccctgg ctcgctacgg cctgtatgcc 540 

tgcgcccacc tacgaccagg gtgatgggtg gataggatcc 600 

aggacggtgc tgggtgttat cggccggaac accgctacgc 660 

catgccgggc aaggctcaac tgtttcccga tgacgatgaa 720 

agtggttatc gcaccgggtg gggaattagt ggctggaccg 780 

ccttgtctgt gaattggatc cggcgaaaag tgctcatgcc 840 

cggtcattac gccaggccag atattttcga gttggaaata 900 

cgtcgagtgg gactga 936 
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165 




170 






175 




Val 


Glu 


Val 


Tyr Val Ala 


Pro 


Thr Tyr Asp 


Gin Gly 


Asp Gly 


Trp 


He 








180 




185 




190 






Gly Ser 


Ala 


Ala Tyr Cys 


Pro 


Gly Arg Thr 


Val Leu 


Gly Val 


He 


Gly 






195 






200 




205 






Arg 


Asn 


Thr 


Ala Thr Arg 


Gin 


Phe Ser Arg 


Thr Cys 


Arg Ala 


Arg 


Leu 


210 






215 




220 








Asn 


Cys 


Phe 


Pro Met Thr 


Met 


Asn Gly He 


Pro Val 


Gly Gin 


Trp 


Leu 


225 




230 






235 






240 


Ala 


Pro 


Gly 


Gly Glu Leu 


Val 


Ala Gly Pro 


Leu Phe 


Arg Glu 


Glu 


Gly 






245 




250 






255 




lie 


Leu 


Val 


Cys Glu Leu 


Asp 


Pro Ala Lys 


Ser Ala 


His Ala 


Lys 


Arg 








260 




265 




270 






Ser 


Phe 


Asp 


Val Ala Gly His 


Tyr Ala Arg 


Pro Asp 


He Phe 


Glu 


Leu 






275 






280 




285 






Glu 


He 


Asp 


Arg Asp Pro 


Gin 


Asp Pro Val 


Glu Trp 


Asp 








290 






295 




300 









<210> 353 
<211> 1035 
<212> DNA 

<213> Psuedomonas putida ATCC 700801 
<400> 353 

atgaatgctt caaaaacaca ttataaagtt gcagctgtgc aagccgctcc tgaattcctg 60 

gatttggaca aaggggttga taaggctata cgcctcatta aagaagcagc tgataacgga 120 

gcatcgctgg ttgcctttcc ggaagtttat cttccagggt atccgtggtg gatctggctt 180 

ggctctcccg catggggaat gcagtttgtt cagcgctacg tagaaaactc gcttgattta 240 

aaaagcgagc agttcgagcg actgtgcaaa gcggctgcta cttaccgtat tcacgttgta 300 

atgggttaca gcgaacgctc gtttggcacc ctctacctcg gtcaagcaat tatcgatgac 360 

agcgggaaag taattggtac gcgccgtaag ctcaagccca cccatgctga gcgtactgtt 420 

tacggtgagg gtaacggcag tgacctcagg gttttcaatt cgcaacttgg aagggtaggt 480 

gcactctgct gcgcagagca cgtacaacca ctctcgaagt ttgcgatgta cagccagcat 540 

gagcagttgc acattgcctc ttggccgagc ttctcggtat atcgcggtgg cgcatatcaa 600 

ctgagcgctg aggctaactg cgcggccacc caagtctatg ctcttgaagg ccagtgcttc 660 

gtaatttcag catgcgcaat cgtatctaaa gacatgctaa acgttctcat cgacacccct 720 

gacaagggta acctgttgca ggatggcggt ggcttcgcga tgatttatgg ccccgatggc 780 

gcaccgctgt gtgaacccct gggcgaacat gaagaaggca tcctttatgc cgacgtcgat 840 

ttgggcgcca tttccgtagc taaagcggca ctcgacccgg ttggccatta ctcgcggcca 900 

gatgttttgc gtctgctttt caacgatcaa ccgacacctt gcgtggaggc gttcaatccg ^960 
gctcctgtcg gtactgatgc tccaggtaca gaccttcagg gtgatgagcc cgatgcgcaa 
ccgatatctg aataa 

<210> 354 
<211> 312 
<212> PRT 

<213> Psuedomonas putida ATCC 700801 
<400> 354 

Met Asn Ala Ser Lys Thr His Tyr Lys Val Ala Ala Val Gin Ala Ala 

15 10 15 

Pro Glu Phe Leu Asp Leu Asp Lys Gly Val Asp Lys Ala He Arg Leu 

20 25 30 

He Lys Glu Ala Ala Asp Asn Gly Ala Ser Leu Val Ala Phe Pro Glu 

35 40 45 

Val Tyr Ser Arg Val Ser Val Val Asp Leu Ala Trp Leu Ser Arg Met 

50 55 60 

Gly Asn Ala Val Cys Ser Ala Leu Arg Arg Lys Arg Leu He Lys Ala 
65 70 75 80 

Ser Ser Ser Ser Asp Cys Ala Lys Arg Leu Leu Leu Thr Val Phe Thr 
85 90 95 



1020 
1035 
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Leu Trp Leu Gin Arg Thr Leu Val Trp His Pro Leu Pro Arg Ser Ser 

100 105 HO 

Asn Tyr Arg Gin Arg Glu Ser Asn Trp Tyr Ala Pro Ala Gin Ala His 

115 120 125 

Pro Cys Ala Tyr Cys Leu Arg Gly Arg Gin Pro Gin Gly Phe Gin Phe 

130 135 140 

Ala Thr Trp Lys Gly Arg Cys Thr Leu Leu Arg Arg Ala Arg Thr Thr 
145 150 155 160 

Thr Leu Glu Val Cys Asp Val Gin Pro Ala Ala Val Ala His Cys Leu 

165 170 175 

Leu Ala Glu Leu Leu Gly He Ser Arg Trp Arg He Ser Thr Glu Arg 

180 185 190 

Gly Leu Arg Gly His Pro Ser Leu Cys Ser Arg Pro Val Leu Arg Asn 

195 200 205 

Phe Ser Met Arg Asn Val Ser Lys Asp Met Leu Asn Val Leu He Asp 

210 215 220 ■ 

Thr Pro Asp Lys Gly Asn Leu Leu Gin Asp Gly Gly Gly Phe Ala Met 
225 230 235 240 

He Tyr Gly Pro Asp Gly Ala Pro Leu Cys Glu Pro Leu Gly Glu His 

245 250 255 

Glu Glu Gly He Leu Tyr Arg Arg Arg Phe Gly Arg His Phe Arg Ser 

260 265 270 

Ser Gly Thr Arg Pro Gly Trp Pro Leu Leu Ala Ala Arg Cys Leu Arg 

275 280 285 

Leu Leu Phe Asn Asp Gin Pro Thr Pro Cys Val Glu Ala Phe Asn Pro 

290 295 300 

Ala Pro Val Gly Thr Asp Ala Pro 
305 310 

<210> 355 
<211> 1014 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 355 

atggccgtct ctaaagacgg tactgtttca ggaaagtcgc ctatccgatt gcatgtcgcc 60 

gcgatacaga tggtcccaaa gctgggtgac gcgcaggcga acgtgaatca ggcagaagcc 120 

cttattcgga aggctcttgg gctgggtgcg cgttggatcg tgttaccaga gatgtttacc 180 

tccggtgcgg cgtttcatcc cgacatgctc aaagccattc agccattcga tggcgcccca 240 

ctccagttgc tgaaagacct ttctcgcaag ggcaatgctg tcatcggcgg ctcgtttctc 300 

gccaagcgtg ggcaacaagt attcaatacc ttcgttttgg tttctccgga cgggtcagtc 360 

gtaacgcatg acaaggattc accgacctat tgggaaaatt gctattaccg gggcggtacc 420 

gatgatggcg tgttgtctac gcccattggc ccggtcggct ccgtcctctg ttgggaattt 480 

atccgctcaa gaaccgcgag acggctggcg aacaaggtca agatggtcgt gggaggctcc 540 

tgttggtgga cgctccccga tgatgctgat ccagacagcc cgcgcagagc cgtgaacctc 600 

aagatgctgc aagaagcgcc ggttcgcatg gcgcggatgc tgggtgttcc ggtaatacat 660 

ggctcccacg cgggcagctt cgaaggattc ttcagtccgg aacttgcgga tgttccctat 720 

aactcgacgt acctgggcga gacaatgatt gtcgacgcgg gtggccgggt acttgcccgt 780 

agagcgcaag atgcaggcga aggcgtggta acggcagaag tggttttgcc cgacaagtcc 840 

gtaccaagcg aacccatccc ggagactttc tggattccca aggaaatgcc ggatgattgg 900 

aaagaagcct gggagcgttg gttcgatacc ggtgcggatt actacgagat ggtgaccgcg 960 

ccctttatca agacgggtgt gataaacgag tacacaccgg aatatcttag gtag 1014 

<210> 356 
<211> 325 
<212> PRT 
<213> Unknown 
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<220> 

<223> Obtained from an environmental sample 

<400> 356 , ^ 

Met Ala Val Ser Lys Asp Gly Thr Val Ser Gly Lys Ser Pro lie Arg 

15 10 15 

Leu His Val Ala Ala He Gin Met Val Gin Ser Trp Val Thr Arg Arg 

20 25 30 

Arg Thr He Arg Gin Lys Pro Leu Phe Gly Arg Leu Leu Gly Trp Val 

35 40 45 

Arg Val Gly Ser Cys Tyr Gin Arg Cys Leu Pro Pro Val Arg Arg Phe 

50 55 60 

He Pro Thr Cys Ser Lys Pro Phe Ser Phe Asp Gly Ala Pro Leu Gin 
65 70 75 80 

Leu Leu Lys Asp Leu Ser Arg Lys Gly Asn Ala Val He Gly Gly Ser 

85 90 95 

Phe Leu Gin Ala Trp Ala Thr Ser He Gin Tyr Leu Arg Phe Gly Phe 

100 105 HO 

Ser Gly Arg Val Ser Arg Asn Ala Gin Gly Ser Pro Thr Tyr Trp Glu 

115 120 125 

Asn Cys Tyr Tyr Arg Gly Gly Thr Asp Asp Gly Val Leu Ser Thr Pro 

130 135 140 

He Gly Pro Ser Ala Pro Ser Ser Val Gly Asn Leu Ser Ala Gin Glu 
145 150 155 160 

Pro Arg Asp Gly Trp Arg Thr Arg Ser Arg Trp Val Gly Gly Ser Cys 

165 170 175 

Trp Trp Thr Leu Pro Asp Asp Ala Asp Pro Asp Ser Pro Arg Arg Ala 

180 185 190 

Val Asn Leu Arg Cys Cys Lys Lys Arg Arg Phe Ala Trp Arg Gly Cys 

195 200 205 

Trp Val Phe Arg Tyr Met Ala Pro Thr Arg Gin Leu Arg Arg He Leu 

210 215 220 

Gin Ser Gly Thr Cys Gly Cys Ser Leu Leu Asp Val Pro Gly Arg Asp 
225 230 235 240 

Asn Asp Val Asp Ala Gly Gly Arg Val Leu Ala Arg Arg Ala Gin Asp 

245 250 255 

Ala Gly Glu Gly Val Val Thr Ala Glu Gly Phe Ala Arg Gin Val Arg 

260 265 270 

Thr Lys Arg Thr His Pro Gly Asp Phe Leu Asp Ser Gin Gly Asn Ala 

275 280 285 

Gly He Gly Lys Lys Pro Gly Ser Val Gly Ser He Pro Val Arg He 

290 295 300 

Thr Thr Arg Trp Pro Arg Pro Leu Ser Arg Arg Val Thr Ser Thr His 
305 310 315 320 

Arg Asn He Leu Gly 
325 

<210> 357 
<211> 951 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



60 
120 



<400> 357 

atgacaaaac tagccatcgt acaaaaaccg ccagtctttc tggataagca aaaaaccatt 
gaqctggccg tcgccaacat tgaagaggcc gccgccaagg gtgccgatct cgtggtgttt 
tctgaagctt tcattcccgg ctatcctgcc tggatctggc gtctacgccc cggcggtgac 180 
tgggggcttt cagaagagtt gcaccagcgt ttgctgcgca atgccgtcaa tgtggactcc 240 
gatgatctgg ctccgttgtt tgaggtcgcc cgcaagcacg aactcaccat cgtttgcggt 300 
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atcgaggagc gtgacaacaa actaagtcaa acaaccttat ataacaccgt catcacgatt 
ggtcccgatg gatcgttact gaacaaacat cgcaagctta tgcccaccaa cccggagcga 
atggtgtggg ggtttggtga cgcatccggt ttaaaagtcg ttgataccaa tgctggtcga 
attggctcat taatgtgctg ggaaaattac atgccgctgg ctcgctatgc cctatatgca 
caaggtgtcg agatctatat cgcaccgacc tacgacagcg gtgatggctg gataggcagc 
atgcagcaca tcgcacgtga agggggctgt tgggtggtgg gatgtgggtg tctcatgaaa 
ggcagtgata ttccagatga tttcccggag aaatccacgt tgtatccaga tgcagatgaa 
tgggtgaacc cgggtgattc tgtagtgata gcacccggcg gtgaaattat ggccggccca 
atgaacagag agtccggtat tttgtatcac gagctagaca gagaaaaagt cagcaacgct 
aaacgagcat tcgatgttgc cgggcattat tcacgtcccg atatctttca gctgcatgta 
aatacacagg agcagtcacc ctgcgtattc gaaaataatt ccataactta a 

<210> 358 
<211> 300 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 358 

Met Thr Lys Leu Ala He Val Gin Lys Pro Pro Val Phe Leu Asp Lys 

1 5 10 15 

Gin Lvs Thr He Glu Leu Ala Val Pro Thr Leu Lys Arg Pro Pro Pro 

20 25 30 

Arg Val Pro He Ser Trp Cys Phe Leu Lys Leu Ser Phe Pro Ala He 

35 40 45 

Leu Trp He Trp Arg Leu Arg Pro Gly Gly Asp Trp Gly Leu Ser Glu 

50 55 60 

Glu Leu His Gin Arg Leu Leu Arg Asn Ala Val Asn Val Asp Ser Asp 
65 70 75 80 

Asp Leu Ala Pro Leu Phe Glu Val Ala Arg Lys His Glu Leu Thr He 

85 90 95 

Val Cys Gly Arg Gly Ala Gin Gin Thr Lys Ser Asn Asn Leu He His 

100 105 U0 

Arg His His Asp Trp Ser Arg Trp He Tyr Thr Asn He Ala Ser Leu 

• 115 120 125 

Cys Pro Pro Thr Arg Ser Glu Trp Cys Gly Gly Leu Val Thr His Pro 

130 135 140 

Leu Lvs Val Val Asp Thr Asn Ala Gly Arg He Gly Ser Leu Met Cys 
145 150 155 160 

Trp Glu Asn Tyr Met Pro Leu Ala Arg Tyr Ala Leu Tyr Ala Gin Gly 

165 170 175 

Val Glu He Tyr He Ala Pro Thr Tyr Asp Ser Gly Asp Gly Trp He 

180 185 190 

Glv Ser Ala Ala His Arg Thr Arg Gly Leu Leu Gly Gly Gly Met Trp 

195 200 205 

Val Ser His Glu Arg Gin Tyr Ser Met He Ser Arg Arg Asn Pro Arg 

210 215 220 

Cys He Gin Met Gin Met Asn Gly Thr Arg Val He Leu Thr Arg Arg 
225 230 235 240 

Asn Tyr Gly Arg Pro Asn Glu Gin Arg Val Arg Tyr Phe Val Ser Arg 

245 250 255 

Ala Arg Gin Arg Lys Val Ser Asn Ala Lys Arg Ala Phe Asp Val Ala 

260 265 270 

Gly His Tyr Ser Arg Pro Asp He Phe Gin Leu His Val He His Arg 

275 280 285 

Ser Ser His Pro Ala Tyr Ser Lys He He Pro Leu 
290 295 300 

<210> 359 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
951 
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<211> 1029 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 359 

atggcaaacg 

ggtacggtag 

tgcgttttcc 

gcggccatgg 

agcacgaacg 

aatgaacgcg 

ctcgtgcaac 

ggtgatgggt 

tgttgggaac 

cacgtcgcca 

accattcgcc 

agcaaggaac 

ggttgttaca 

ggcgaaggca 

atggatagcg 

ccaacccaca 

aatacacgaa 

cgcgagtaa 



tcgttcgtgc 
agaaggtagt 
cggagacggt 
gcaaagacca 
cgattgccgc 
atcacggtac 
ggcgtcgcaa 
cgggtttgcg 
attacaatcc 
tgtttccggg 
accacgcgct 
aggtcgccca 
ccgcgattgt 
tggtcgtggc 
tcgggcacta 
cggcggtcga 
caccaaagcg 



tgcggccgta 
tgctgcgatc 
tgttccctat 
catgcagctg 
ggcggccaaa 
gatatacaac 
gataaccccc 
ctgtgtcgac 
cttggcgcgc 
ttcgatggtg 
cgaaagcggc 
gttgtcacaa 
atcgcctgaa 
ggatatggat 
cagtcgcccg 
cgtcgaattc 
cacaactggc 



cagttgagcc 
cgtgacgccg 
tatccgtatt 
tacgagcaag 
caacactcga 
acgcagttgt 
acgttccacg 
acacaaatcg 
tacgcattga 
ggtcagatct 
tgtttcgtcg 
ggcacgtcgc 
ggcgtcgtac 
ctcagcctca 
gaattgctgt 
aactccaatt 
ccacgttcga 



ctgttttggg 
cctcgcaggg 
tctcgttcat 
ctgtggtcgt 
tcgtcgtttc 
tgttcgatgc 
agcgtatggt 
ggcgcatcgg 
tggccgatca 
tcgccgatca 
tcaacgctac 
tcgacgcggc 
tgggcgaacc 
tcaccaaacg 
ctctgctgat 
ccgagtctca 
accttcaagt 



tagtcgcgag 
cgcacagctg 
tcggccgccc 
gccttctccc 
aatcggcgtc 
cgacgggaca 
gtggggtcaa 
tagcctggct 
cgaagagatc 
aattcaggta 
ggggtatctg 
gttgaccggt 
gctcaccgac 
caaacgcatg 
caatcgaacg 
tcatgtcagc 
tgccgctgat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1029 



<210> 360 
<211> 335 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 360 

Met Ala Asn Val Val Arg Ala Ala Ala Val Gin Leu Ser Pro Val Leu 

1 5 10 15 

Gly Ser Arg Glu Gly Thr Val Glu Lys Val Val Ala Ala He Arg Asp 

20 25 30 

Ala Ala Ser Gin Gly Ala Gin Leu Cys Val Phe Pro Glu Thr Val Val 

35 40 45 

Pro Tyr Ser Val Phe Leu Val His Ser Ala Ala Arg Gly His Gly Gin 

50 55 60 

Arg Pro His Ala Ala Val Arg Ala Ser Cys Gly Arg Ala Phe Ser Gin 
65 70 75 80 

His Glu Arg Asp Cys Arg Gly Gly Gin Thr Thr Leu Asp Arg Arg Phe 

85 90 95 

Asn Arg Arg Met Asn Ala He Thr Val Arg Tyr Thr Thr Arg Ser Cys 

100 105 HO 

Cys Ser Met Pro Thr Gly His Ser Cys Asn Gly Arg Lys He Thr Pro 

115 120 125 

Thr Phe His Glu Arg Met Val Trp Gly Gin Gly Asp Gly Ser Gly Leu 

130 135 140 

Arg Cys Val Asp Thr Gin He Gly Arg He Gly Ser Leu Ala Cys Trp 
145 150 155 160 

Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met Ala Asp His Glu 

165 170 175 

Glu lie His Val Ala Met Phe Pro Gly Ser Met Val Gly Gin He Phe 

180 185 190 

Ala Asp Gin He Gin Val Pro Phe Ala Thr Thr Arg Ser Lys Ala Ala 
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1 OR 






200 








205 








VOX OcL 


Ser 


Ser 


Thr 


Leu Arg Gly 


He 


Ala 


Arg Asn 


Arg 


Ala 


Gin 


Leu 










215 






220 










ser bin 


oiy 




Ser 


Leu Asp Ala 


Ala 


Leu 


Thr Gly Gly 


Cys 


Tyr 


Thr 








230 






235 








240 


Ala He 


Val 


Ser 


Pro 
245 


Glu Gly Val 


Val 


Leu 
250 


Gly Glu 


Pro 


Leu 


Thr 
255 


Asp 


Gly Glu 


Gly 


Met 


Val 


Val Ala Asp 


Met 


Asp 


Leu Ser 


Leu 


He 


Pro 


Asn 


260 






265 








270 






Ala Asn 


Ala 


Trp 


He 


Ala Ser Gly 


Thr 


Thr 


Val Ala 


Arg 


Asn 


Cys 


Cys 




275 




280 








285 








Leu Cys 


Ser 


He 


Thr 


Pro Thr His 


Thr 


Ala 


Val Asp 


Val 


Glu 


Phe 


Asn 


290 








295 






300 








Ala 


Ser Asn 


Ser 


Glu 


Ser 


His His Val 


Ser 


Asn 


Thr Arg 


His 


Gin 


Ser 


305 








310 






315 








320 


Gin Leu 


Ala 


His 


Val 
325 


Arg Thr Phe 


Lys 


Leu 
330 


Pro Leu 


He 


Ala 


Ser 
335 





<210> 361 
<211> 951 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<400> 361 cn 

atgctaaacg agaacaatgg cgctactttc aaggttgctg ccgtgcaggc ttcaccggta 60 

tttcttgatc gcgctgctac aatcgacaag gcttgcgatt taattgctac ggccggacgc 120 

gagggggctc gcctgatcgt ctttccagaa gcgttcgtcc cggcctatcc tgattgggta 180 

tgggcgattc ccgcgggtga tgagggcatg ctcaatgagc tgtatgcaga attacttgcc 240 

aatgctgtca ccattcccag cgatgcgacc gagaggttgt gtcgcgcggc gcggcttgct 300 

aatgcttacg tggtgatggg gatgagcgaa cgcaatgccg aagcgagtgg cgccagcctg 360 

tataatacgc tgttgtatat tgatgcacag ggacaaatcc tgggaaagca ccggaagctg 420 

gttccaacgg gcggcgagcg cctggtatgg gcacagggag atggcagcac cctggaggtt 480 

tacgatactc ctttgggaaa actcggtggc ttaatctgct gggagaatta tatgccgctg 540 

gcacgctata ctatgtatgc ctggggcacg caaatctaca ttgcagcgac gtgggatcgc 600 

gggcagccat ggctatccac tttgcgacac attgctaaag agggcagagt atatgtgatc 660 

ggctgttgta ttgctatgcg caaagatgat atccccgacc attacgcgat gaaggagaag 720 

tattacgcgg aagaagacga gtggatcaat attggcgata gcgcaatcgt caatccagaa 7 80 

ggggtattta ttgccgggcc agtgcgtaag caagaagaaa tcctctacgc cgaggttgac 840 

ccgcgaatga tgcaggggcc aaagtggatg ctcgacgtgg caggacatta cgcgcgcccg 900 

gatgtattcc agttgacggt gcacacggag aggcggcaga tgatccacta g 951 

<210> 362 

<211> 302 

<212> PRT 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 362 

Met Leu Asn Glu Asn Asn Gly Ala Thr Phe Lys Val Ala Ala Val Gin 

I 5 10 15 

Ala Ser Pro Val Phe Leu Asp Arg Ala Tyr Asn Arg Gin Gly Leu Arg 

20 25 30 

Phe Asn Cys Tyr Gly Arg Thr Arg Gly Gly Ser Pro Asp Arg Leu Ser 

35 40 45 

Arg Ser Val Arg Pro Gly Leu Ser Leu Gly Met Gly Asp Ser Arg Gly 
50 55 6( > 
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Glv 


His 


Ala 


Gin 


Ala Vai Cys 


Glu 


Leu 


Leu 


Ala Asn 


Ala 


Vai Thr 


He 










70 








75 






80 


Pro 


Ser 


Asp Ala 


Thr Glu Ara 


Leu 


Cys 


Arg 


Ala Ala 


Arg 


Leu Ala 


Asn 










O J 






90 






95 




Ala 




vai 


vai 


Met Glv Met 


Ser 


Glu 


Arg 


Asn Ala 


Glu 


Ala Ser 


Glv 








1UU 






105 








110 




Ala 
nla 




Leu 


Tyr 


noil l lis. i-iC u 


Leu 


iy t 


Cys 


Thr Gly 


Thr 


A^n Pro 

noil v 


Glv 






115 














1 9 5 






Lys 


nla 


Pro 


Glu 


nXd cly Oct 


Asn 


Gly 


g 


Arg AJ.3 


Pro 


wly lie 


Gly 


























Thr 


Gly 


Arg 


Trp 


f^l n H*i q Prn 
uxii nio xr i. u 


Gly 


Gly 


Leu 


Arg Tyr 


Ser 


Phe Gly 


Lys 


145 








i ^n 

J. JV 








i 






160 


Thr 


Arg 


Trp 


Leu 


JX^n TtPii T.P»n 

noil -DC U XJCU 


Arg 


lie 


lie 


Cys Arg 




His Ala 


He 
















1 IU 






175 




Leu 


Cys 


Met 


Pro 


uiy nid mi. y 


Lys 




Thr 


T.oi-j fzl n 




Arg Gly 


He 








180 






185 








190 




Ala 


Gly 


Ser 


His 


G1 w Tur Pit* 


Leu 


Cys 


Asp 


Thr Leu 

1. 11JL UvU 


Leu 


Lv<? Arcr 

*4y O Hi %j 


Ala 






195 














& V/J 






Glu 


Tyr 


Met 


Ser 


nid vai vcix 


Leu 




Ala 


Hi n A to 

vj jlii ni. y 




Pro Arg 


Pro 




210 






91 ^ 








990 








Leu Airg 


Asp Glu 


Gly Glu Vai 


Leu 


Arg 


Gly 


Arg Arg 


Arg 


Vai Asp 


Gin 


225 








230 








235 






240 


Tyr 


Trp 


Arg Arg 


Asn Arg Gin 


Ser 


Arg 


Arg 


Gly He 


Tyr 


Cys Arg 


Ala 










245 






250 






255 




Ser 


Ala 


Ala 


Arg 


Arg lie Leu 


Tyr 


Ala 


Glu 


Vai Asp 


Pro 


Arg Met 


Met 








260 






265 








270 




Gin 


Gly 


Pro 


Lys 


Trp Met Leu 


Asp 


Vai 


Ala 


Gly His 


Tyr 


Arg Ala 


Arg 






275 






280 








285 






Met 


Tyr 


Ser 


Ser 


Arg Cys Thr 


Arg 


Arg 


Gly 


Gly Arg 


Ser 


Thr 






290 






295 








300 









<210> 363 
<211> 1053 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
<400> 363 

atgccgaaaa agtcgaccgt ccgggtcgca gccgtccaga ttgcgccgga tctgacatcg 60 

egggaaaaga cggtggcacg cgtgatcgag gcgatcgccc aggcatccgc caaaggtgcg 120 

gagcttgtgg tttttcccga gacctttgtg ccgtggtatc cttatttctc gttcgtgttg 180 

cccccggtct tgtcgggcaa ggagcacctg cggctctacg aagaggcggt tgcggtgcca 240 

agtgccgcca caagaagegt agcggctgcc gctcgcgaac atggcatcgt cgtggcgctt 300 

ggcgtcaacg agegegacta tggcacgctc tacaataege aactgetttt egatgecgat 360 

ggcagtctga tectgaageg gegcaagate accccgactt tccacgagcg gatgatctgg 420 

ggccagggcg atgectcagg cctgaaggtt gtcgacagcg ccattggccg catcggcgcg 480 

ctggcctgct gggaacacta caatccgcta gcccgctatg cgctgatggc gcagcacgag 540 

gaaatccaca ttgcgcagtt tcccggctcc atggtcgggc egatctttge cgatcagatg 600 

gaggtgacga tccgccatca cgcgctggaa ageggctget tegtegtcaa tgccacggga 660 

tggctgaegg atgatcagat cgtctcgatc acaceggata ccggcctgca aaaagcgctg 720 

cggggtggct geatgaegge gatcatttcc cccgaaggca ageatctegt gccgccgctc 780 

accgaaggtg agggtatcct cgtcgccgat ctcgacatga gcctcattct caagegcaag 840 

cgcatgatgg atteggtegg ccactatgcc cggcccgagt tgctgcacct cgtcatggac 900 

gcccggccgg ctgcgccgat gagggaatcg tccatgccca ctgccttccc eggegaaaca 960 

ttgacaaccg acatgaccga tggagaacag gatgegtett tegaeggaaa cgctgatcaa 1020 

egaattgeag tectteggag cccggctggt tga 1053 

<210> 364 
<211> 335 
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<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 364 



Met 


Pro 


Lys 


Lys 


Ser 


Thr 


Val 


Arc 


Val 


Ala 


Ala 


Val 


Gin 


He Ala 


Pro 


1 








5 










10 








15 




Asp 


Leu 


Thr 


Ser 


Arg 


Glu 


Lys 


Glv 


Gly Thr 


Arg 


Asp 


Arg 


Gly Asp 


Arg 








20 










25 










30 




Pro Gly 


He 


Arg 


Gin 


Arg 


Cys 


Glv 


Ala 


Cys 


Glv 


Phe 


Ser 


Arg Asp 


Leu 






35 










40 










45 






Cys 


Arg 


Gly 


lie 


Leu 


He 


Ser 


Arg 


Ser 


Cys 


Cys 


Pro 


Am 


Ser Cys 


Arg 




50 










55 










60 








Ala 


Arg 


Ser 


Thr 


Cys 


Gly 


Ser 


Thr 


Lys 


Arg 


Arg 


Leu 


Arg 


Cys Gin 


Val 


65 










70 










75 








80 


Pro 


Pro 


Gin 


Glu 


Ala 


Arg 


Leu 


Pro 


Leu 


Ala 


Asn 


Met 


Ala 


Ser Ser 


Tro 










85 










90 








95 




Arg 


Leu 


Ala 


Ser 


Thr 


Ser 


Ala 


Thr 


Met 


Ala 


Arg 


Ser 


Thr 


He Arg 


Asn 








100 










105 










no- 




Cys 


Phe 


Ser 


Met 


Pro 


Met 


Ala 


Val 


Ser 


Ser 


Glv 


Ala 


Arcr 


Ser Pro 


Ara 






115 










120 










125 






Leu 


Ser 


Thr 


Ser 


Gly 


Ser 


Gly 


Ala 


Arg 


Ala 


Met 


Pro 


Gin 


Ala Glu 


Glv 




130 










135 










140 








Cys Arg 


Gin 


Arg 


His 


Trp 


Pro 


His 


Arg Arg 


Ala 


Glv 


Leu 


Leu Glv 


Thr 


145 










150 










155 








160 


Leu 


Gin 




Ala 


Ser 


Pro 


Tyr 


Ala 


Leu 


Met 


Ala 


Gin 


His 


Glu Glu 


He 










165 










170 








175 




His 


He 


Ala 


Gin 


Phe 


Pro Gly 


Ser 


Met 


Val 


Gly 


Pro 


He 




Asd 








180 










185 










190 




Gin 


Met 


Glu 


Val 


Thr 


He Arg 


His 


His 


Ala 


Leu 


Glu 


Ser 


Gly Cys 


Phe 






195 










200 










205 






Val 


Val 


Asn 


Ala 


Thr 


Gly Trp 


Arg 


Met 


He 


Arg 


Ser 


Ser 


Arg Ser 


His 




210 










215 










220 








Arg 


He 


Pro 


Ala 


Cys 


Lys 


Lys 


Arg 


Cys Gly 


Val 


Ala 


Ala 


Gly Asp 


His 


225 










230 










235 








240 


Phe 


Pro 


Arg 


Arg Gin 


Ala 


Ser 


Arg 


Ala 


Ala 


Ala 


His 


Arg 


Arg Gly 


Tyr 










245 










250 








255 




Pro 


Arg 


Arg 


Arg 


Ser 


Thr 


Ala 


Ser 


Phe 


Ser 


Ser 


Ala 


Ser 


Ala Trp 


He 








260 










265 










270 




Arg 


Ser 


Ala 


Thr 


Met 


Pro 


Gly 


Pro 


Ser 


Cys 


Cys 


Thr 


Ser 


Ser Trp 


Thr 






275 










280 










285 






Pro 


Gly 


Arg 


Leu Arg 


Arg Gly 


Asn 


Arg 


Pro 


Cys 


Pro 


Leu 


Pro Ser 


Pro 




290 










295 










300 








Ala 


Lys 


He 


Asp Asn 


Arg 


His 


Asp 


Arg 


Trp 


Arg 


Thr 


Gly 


Cys Val 


Phe 


305 










310 










315 








320 


Arg Arg 


Lys 


Arg 


Ser 


Thr 


Asn 


Cys 


Ser 


Leu 


Arg 


Ser 


Pro 


Ala Gly 





325 330 335 



<210> 365 
<211> 975 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 365 

atgacaaagc tagccatcgt tcaaaaaccc cccgtctttc tggataaaga aaaaaccata 60 
gcgaagacgg ttgattccat aaaagaggcc gcgacacaaa atgccgactt ggtcatcttc 120 



Page 238 



WO 03/097810 PCTAJS03/15712 



accgaagcct tcatcccggg ctaccccacc tggatatggc gacttaggcc aggcgctgat 180 

tggggcctct cagaagagct gcacgagcag ttattgcgta acgcggtgag tatgggatcg 240 

accgacctgg atccgcttta tgaagccgcc caacagcata acgtcactat tgtttgcggc 300 

atcgtagaaa gagaccacca actcagccaa tcaaccctct acaacagcat ggtcgtcatt 360 

gacacagacg gaacccttct caacaagcac cgcaaactca tgccaactaa tcccgaacgc 420 

atggtgtggg gctttggcga cgcctcagga ctcaaagccg ttgcaacacc tgcaggccgc 480 

atcagcacgt tgttgtgttg ggagaactac atgccattgg cccgatatgc tctgtatgca 540 

caaggcgtgg aaatctatat cgcgccaact tacgacagtg gtgcgggttg gataggaagc 600 

ttgcaacaca tagcacgcga aggtcgatgc tgggtcgtgg gctgtggcaa cctgattcag 660 

gccagtgatc tgcctgaaga cttcccggac aaggacaacc tctacccgga cgcagaagag 720 

tgggtgaacc cgggtgactc catagtcatt gcaccagacg gtgagattgt ggccggtcca 780 

atgcacaaag agacaggaat tttgtactgc gagatagatc tggagaaagt cagaattgca 840 

aaacgagcat tagacgtgac cgggcattat tcgcgaccgg acgttttcaa actgcatgtg 900 

aatacccgac ctcaatcacc tgtggaattt gaaggtcagg agaccaacaa tccaacaaca 960 

ggagaaagct catga 975 

<210> 366 
<211> 315 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 366 












Met 


Thr 


Lys 


Leu Ala He Val Gin 


Lys 


Pro 


Pro Val Phe Leu Asp 


Lys 


1 




5 




10 


15 




Glu 


Lys 


Thr 


He Ala Ly3 Thr Val 


He 


Pro 


Lys Arg Pro Arg His 


Lys 






20 


25 




30 




Met 


Pro 


Thr 


Trp Ser Ser Ser Pro 


Lys 


Pro 


Ser Ser Arg Ala Thr 


Thr 






35 


40 






45 




Trp 


He 


Trp 


Arg Leu Arg Pro Gly 


Ala 


Asp 


Trp Gly Leu Ser Glu 


Glu 




50 




55 






60 




Leu 


His 


Glu 


Gin Leu Leu Arg Arg 


Gly Glu 


Tyr Gly He Asp Arg 


Pro 


65 






70 






75 


80 


.Gly 


Ser 


Ala 


Leu Ser Arg Pro Thr 


Ala 


Arg 


His Tyr Cys Leu Arg 


His 






85 




90 


95 




Arg 


Arg 


Lys 


Arg Pro Pro Thr Gin 


Pro 


He 


Asn Pro Leu Gin Gin 


His 








100 


105 




110 




Gly Arg 


His 


His Arg Arg Asn Pro 


Ser 


Gin 


Gin Ala Pro Gin Thr 


His 






115 


120 






125 




Ala 


Asn 


Ser 


Arg Thr His Gly Val 


Gly Leu 


Trp Arg Ala Ser Gly Leu 




130 




135 






140 




Lys 


Ala 


Val 


Ala Thr Pro Ala Gly 


Arg 


He 


Ser Thr Leu Leu Cys 


Trp 


145 






150 






155 


160 


Glu 


Asn 


Tyr 


Met He Gly Pro He 


Cys 


Ser 


Val Cys Thr Arg Arg Gly 






165 




170 


175 




Asn 


Leu 


Tyr 


Arg Ala Asn Leu Arg 


Gin 


Trp 


Cys Gly Leu Asp Arg Lys 






180 


185 




190 




Leu 


Ala 


Thr 


His Ser Thr Arg Arg 


Ser 


Met 


Leu Gly Arg Gly Leu 


Trp 






195 


200 






205 




Gin 


Pro 


Asp 


Ser Gly Ser Asp Leu 


Pro 


Glu 


Asp Phe Pro Asp Lys 


Asp 




210 


215 






220 




Asn 


Leu 


Tyr 


Pro Asp Ala Glu Glu 


Trp Val 


Asn Pro Gly Asp Ser 


He 


225 




230 






235 


240 


Val 


He 


Ala 


Pro Asp Gly Glu He 


Val 


Ala 


Gly Pro Met His Lys 


Glu 








245 




250 


255 




Thr 


Gly 


He 


Leu Tyr Arg Asp Arg 


Ser Gly 


Glu Ser Gin Asn Cys 


Lys 






260 


265 




270 




Thr 


Ser 


He 


Arg Arg Asp Arg Ala 


Leu 


Phe 


Ala Thr Gly Arg Phe 


Gin 






275 


280 






285 
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Thr Ala Cys Glu Tyr Pro Thr Ser lie Thr Cys Gly He Arg Ser Gly 

290 295 300 

Asp Gin Gin Ser Asn Asn Arg Arg Lys Leu Met 
305 310 315 

<210> 367 
<211> 981 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 367 

atgactgccc actttgccga taccctgacc gttgccgtgg cccagatcgc gccggtgtgg 60 

ctacagcgcg aagctaccct gcaaaagatg ctggtctggg tggagcgcgc cgcgcgcgaa 120 

ggcgcgggcc tggtcgcctt cagtgagggc ctgctgcctg gctacccctt ctggattgaa 180 

cacacggacg gcgcccgctt cgagtcgccg ttacagaagc gcctgtacgc ccactactgc 24 0 

gatcagtcgg tccagatcaa tgctgggcac ctcgcaccac tctgcgctgc cgcggccaga 300 

caccagattt gggtggtctg cggcgtcatc gagcgcgaca gcgcacgcgg actcagcgtg 360 

tttgcatcaa tggtcaccat cgatgccgaa ggcgcgatcc gcagtgtgca ccgcaagctg 420 

atgccgacct acgaagaacg cctggtgtgg tcgcccggcg acgcgcacgg actgcgctgc 480 

catccgctcg gccagttccg cctcggcagc ctcaattgct gggagaactg gatgccgctg 540 

gcgcgcgccg ccctgtacgc ccagggcgag tctttgcatg ttgcatcctg gcccggcagt 600 

cgccgcaaca ccgagaccat tactcccttc atcgcccgcg aaggccgcag ttacgcgctg 660 

tccgccagtt ccgtgctgca ccgggatgat ctgcccgact ccgttccggc gctgtcggtg 720 

ctgcgcgact gcctgccgga cgtgatggcc gacggcggct cctgcgtcgc cggccccgac 780 

ggacatttcc tgatcgagcc ggtcgtcggc cgggaagagc tgctgctcgc gcagatcgat 840 

catgcccggg tacgcgagga acgtcagaac ttcgacccct tcggccacta ctcgcggccg 900 

gaactcctgt cgctggtggt ggatacgcgc cgggcgagcg gagtgcagat agtgaatgct 960 

gaccatggct ttaagccctg a 981 

<210> 368 

<211> 317 

<212> PRT 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 368 



Met 


Thr 


Ala 


His 


Phe 


Ala 


Asp 


Thr 


Leu 


Thr 


Val 


Ala 


Val 


Ala 


Gin 


He 


1 








5 










10 










15 




Ala 


Pro 


Val 


Trp 
20 


Leu 


Gin 


Arg 


Glu 


Tyr 
25 


Pro 


Ala 


Lys 


Asp 


Ala 
30 


Gly 


Leu 


Gly 


Gly 


Ala 
35 


Arg 


Arg 


Ala 


Arg 


Arg 
40 


Arg 


Gly 


Pro 


Gly 


Arg 
45 


Leu 


Gin 


Gly 


Pro 


Ala 
50 


Ala 


Trp 


Leu 


Pro 


Leu 
55 


Leu 


Asp 


Thr 


His 


Gly 
60 


Arg 


Arg 


Pro 


Leu 


Arg 


Val 


Ala 


Val 


Thr 


Glu 


Ala 


Cys 


Thr 


Pro 


Thr 


Thr 


Ala 


He 


Ser 


Arg 


65 










70 










75 










80 


Ser 


Arg 


Ser 


Met 


Leu 


Gly 


Thr 


Ser 


His 


His 


Ser 


Ala 


Leu 


Pro 


Ala 


Arg 








85 










90 










95 




His 


Gin 


He 


Trp 
100 


Val 


Val 


Cys 


Gly 


Val 
105 


He 


Glu 


Arg 


Asp 


Ser 
110 


Ala 


Arg 


Gly 


Leu 


Ser 


Val 


Phe 


Ala 


Gin 


Trp 


Ser 


Pro 


Ser 


Met 


Pro 


Lys 


Ala 


Arg 




115 










120 










125 








Ser 


Ala 
130 


Val 


Cys 


Thr 


Ala 


Ser 
135 


Cys 


Arg 


Pro 


Thr 


Lys 
140 


Asn 


Ala 


Trp 


Cys 


Gly 


Arg 


Pro 


Ala 


Thr 


Arg 


Thr 


Asp 


Cys 


Ala 


Ala 


He 


Arg 


Ser 


Ala 


Ser 



145 150 155 160 
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Ser Ala 


Ser 


Ala Ala Gin Leu 


Leu Gly Glu Leu Asp 


Ala 


Ala 


Gly 


Ala 






165 


170 






175 




Arg Arg 


Pro 


Val Arg Pro Gly 


Arg Val Phe Ala Cys 


Cys 


He 


Leu 


Ala 






180 


185 




190 






Arg Gin 


Ser 


Pro Gin His Arg 


Asp His Tyr Ser Leu 


His 


Arg 


Pro 


Arg 




195 




200 


205 








Arg Pro 


Gin 


Leu Arg Ala Ser 


Ala Ser Ser Val Leu 


His 


Arg 


Asp 


Asp 


210 




215 


220 










Leu Pro 


Asp 


Ser Val Pro Ala 


Leu Ser Val Leu Arg 


Asp 


Cys 


Leu 


Arg 


225 




230 


235 








240 


Thr Trp 


Pro 


Thr Ala Ala Pro 


Ala Ser Pro Ala Pro 


Thr 


Asp 


He 


Ser 




245 


250 






255 




Ser Ser 


Arg 


Ser Ser Pro Gly 


Arg Ala Ala Ala Arg 


Ala 


Asp 


Arg 


Ser 






260 


265 




270 






Cys Pro 


Gly 


Thr Arg Gly Thr 


Ser Glu Leu Arg Pro 


Leu 


Gly 


His 


Tyr 




275 




280 


285 








Ser Arg 


Pro 


Glu Leu Leu Ser 


Leu Val Val Asp Thr 


Arg 


Arg 


Ala 


Ser 


290 




295 


300 










Gly Val 


Gin 


He Val Asn Ala 


Asp His Gly Phe Lys 


Pro 








305 




310 


315 











<210> 369 
<211> 1074 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
<400> 369 

atgtccgaag agaacaccac caccgaatca agctcggacg cgatctggac cgttgcggcc 60 

gtccaggcgg cgccggtgtt cttgaatcgg gatgcgacgg tgcgtaaggc cgtctcgctg 120 

attgccgagg ccgcggcgca cggcgcgcgt ctcattgttt ttcccgaggc gttcattcca 180 

tcgtacccgg attgggcgtg ggcggtcccg cccggacagg gcggaaccaa ctcgagactg 240 

tatgccaaac tgctcgacaa ttcggttacg gtgcccagcc cagccaccga tgccctggcc 300 

agggcggctc gcgacgcggg cgcctacgtc gtcatgggca taaacgagcg gaacacggcg 360 

gcgagcggcg gaagtctcta caacagcttg ctctatattg gtccggacgg tcgcatcctg 420 

ggcattcatc ggaagttggt gccgacctcg gcggagcggc tgatctgggc acaaggcgat 480 

ggaagcacgc ttggcgtgtt cgatacgccg ggcggccggc ttggcggctt gatctgctgg 540 

gaaaactata tgccgctggc ccgatattcg atgtacgcgc gcggcgtgca aatatatgtt 600 

gcggcgacct gggacagggg cgagccatgg ctttcgacgc tgcgacacat agccaaggaa 660 

ggccagacct acgtgatagg ctgttgcatc gccatgcgga cggccgacat cgacgacgcc 720 

gagctcgtcg agaagtatta cgcggacgcc ggtgagtgga tcaatgaagg cgacagcgcg 780 

attgtcgatc cgagcggaac aatcattgcc ggtccggccc atcagaccaa tgaaatcctc 840 

tacgccgcga tcgatcgcca gaaggtgctg gaatcaaaat ggatgttgga cgtggccggg 900 

cactacgcgc gtccggacgt gttttcattt ggcgttcgaa ccgatgccaa cccgataatg 960 

accatgaacg aaccaagcgc gacggccgag ccgaggcaca atagcgcggg agccgagggt 1020 

cgcgacggcc tacgcgggcg tcgtgaccct cgctcgagaa ttcggcagat gtag 1074 

<210> 370 
<211> 346 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 370 

Met Ser Glu Glu Asn Thr Thr Thr Glu Ser Ser Ser Asp Ala He Trp 

1 5 10 15 

Thr Val Ala Ala Val Gin Ala Ala Gly Val Leu Glu Ser Gly Cys Asp 
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20 25 30 

Gly Ala Gly Arg Leu Ala Asp Cys Arg Gly Arg Gly Ala Arg Ala Arg 

35 40 45 

Leu lie Val Phe .Pro Glu Ala Phe lie Pro Ser Tyr Pro Asp Trp Ala 

50 55 60 

Trp Ala Val Pro Pro Gly Gin Ala Glu Pro Thr Arg Asp Cys Met Pro 
65 70 75 80 

Asn Cys Ser Thr He Arg Leu Arg Cys Pro Ala Gin Pro Pro Met Pro 

85 90 95 

Gly Gin Gly Gly Ser Arg Arg Gly Arg Leu Arg Arg His Gly His Lys 

100 105 110 

Arg Ala Glu His Gly Gly Ser Gly Gly Ser Leu Tyr Asn Ser Leu Leu 

115 120 125 

Tyr He Gly Pro Asp Gly Arg He Leu Gly He His Arg Lys Leu Cys 

130 135 140 

Arg Pro Arg Arg Ser Gly Ser Gly His Lys Ala Met Glu Ala Arg Leu 
145 150 155 160 

Ala Cys Ser He Arg Arg Ala Ala Gly Leu Ala Ala Ser Ala Gly Lys 

165 170 175 

Thr He Cys Arg Trp Pro Asp He Arg Cys Thr Arg Ala Val Gin He 

180 185 190 

Tyr Val Ala Ala Thr Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu 

195 200 205 

Arg His He Ala Lys Lys Ala Arg Pro Thr Ala Val Ala Ser Pro Cys 

210 215 220 

Gly Arg Pro Thr Ser Thr Thr Pro Ser Ser Ser Arg Ser He Thr Arg 
225 230 235 240 

Thr Pro Val Ser Gly Ser Met Lys Ala Thr Ala Arg Leu Ser He Arg 

245 250 255 

Ala Glu He He Ala Gly Pro Ala His Gin Thr Asn Glu He Leu Tyr 

260 265 270 

Ala Ala He Asp Arg Gin Lys Val Leu Glu Lys Met Asp Val Gly Arg 

275 280 285 

Gly Arg Ala Leu Arg Ala Ser Gly Arg Val Phe He Trp Arg Ser Asn 

290 295 300 

Arg Cys Gin Pro Asp Asn Asp His Glu Arg Thr Lys Arg Asp Gly Arg 
305 310 315 320 

Ala Glu Ala Gin Arg Gly Ser Arg Val Ala Thr Ala Tyr Ala Gly Val 

325 330 335 

Val Thr Leu Ala Arg Glu Phe Gly Arg Cys 
340 345 

<210> 371 
<211> 1014 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 371 

atgggtatcg aacatccgaa gtacaaggtt gcggtggtac aggcggcgcc ggcctggctc 60 

gatctcgacg catcgatcgc caagaccatc gcgctgatcg aggaggcggc cgccaagggc 120 

gccaagctga tcgcattccc tgaggccttc attcccggct atccttggca tatctggatg 180 

gactcgccgg cctgggcgat cgggcgcgga tttgtgcagc gctatttcga caattcgctt 240 

tcctacgaca gcccgcaggc cgaacggctg cggctcgcgg tgaagaaggc cggcatcacc 300 

gccgtgctcg gcctgtccga gcgggaaggc ggcagccttt atctcgcgca atggctgatc 360 

ggtcccgacg gcgagaccat cgccaagcgg cgcaagctgc ggccgacgca tgccgagcgc 420 

accgtctacg gcgaaggcga cggcagtgac cttgcggtcc atgaccgcgc tgacattggc 480 

cgtctcggcg cgctgtgctg ctgggaacat ctgcagccgc tgtcgaaata cgccatgtat 54 0 

gcccagaacg agcaggtgca tgtcgcggcc tggccgagtt tttcgctgta cgacccgttc 600 
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gcgccggcgc tgggctggga ggtcaacaac gcggcatccc gcgtctatgc ggtcgaaggc 660 

tcctgcttcg tgctggcgcc gtgtgccacc gtctcgcagg cgatggtgga cgaactctgc 720 

gaccgcgacg acaagcatgc gctgctgcat gtcggcggcg gccacgccgc gatctacgga 780 

ccggacggca gctcgatggc gaacaagctc gatcccgagc aggagggcct gctgttcgcc 840 

gacatcgatc tcggggcgat cggggtggca aagaacgccg ccgatccggc cgggcactat 900 

tcgcggccgg atgtgacccg tctgctcttg aacagaaaac cctcaaagcg cgtcgagcac 960 

tttgcgctgc cgctcgacca tctcgcggac gagggcgttg ctccggtgac ctga 1014 

<210> 372 
<211> 327 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 372 

Met Gly lie Glu His Pro Lys Tyr Lys Val Ala Val Val Gin Ala Ala 

15 10 15 

Pro Ala Trp Leu Asp Leu Asp Ala Arg Ser Pro Arg Pro Ser Arg Ser 

20 25 30 ' 

Arg Arg Arg Pro Pro Arg Ala Pro Ser Ser His Ser Leu Ala Phe lie 

35 40 45 

Pro Gly Tyr Pro Trp His lie Trp Met Asp Ser Pro Ala Trp Ala He 

50 55 60 

Gly Arg Gly Phe Val Gin Ala He Ser Thr He Arg Phe Pro Thr Thr 
65 70 75 80 

Ala Arg Arg Pro Asn Gly Cys Gly Ser Arg Arg Arg Pro His His Arg 

85 90 95 

Arg Ala Arg Pro Val Arg Ala Gly Arg Arg Gin Pro Leu Ser Arg Ala 

100 105 110 

Met Ala Asp Arg Ser Thr Ala Arg Pro Ser Pro Ser Gly Ala Ser Cys 

115 120 125 

Gly Arg Arg Met Pro Ser Ala Pro Ser Thr Ala Lys Ala Thr Ala Val 

130 135 140 

Thr Leu Arg Ser Met Thr Ala Leu Thr Leu Ala Val Ser Ala Arg Cys 
145 150 155 160 

Ala Ala Gly Asn Leu Gin Pro Leu Ser Lys Tyr Ala Met Tyr Ala Gin 

165 170 175 

Asn Glu Gin Val His Val Ala Ala Trp Pro Ser Phe Ala Val Arg Pro 

180 185 190 

Val Arg Ala Gly Ala Gly Leu Gly Gly Gin Gin Arg Gly He Pro Arg 

195 200 205 

Leu Cys Gly Arg Gly Ser Cys Phe Val Leu Ala Pro Cys Ala Thr Val 

210 215 220 

Ser Gin Ala Met Val Asp Glu Leu Cys Asp Arg Asp Gin Ala Cys Ala 
225 230 235 240 

Ala Ala Cys Arg Arg Arg Pro Arg Arg Asp Leu Arg Thr Gly Arg Gin 

245 250 255 

Leu Asp Gly Thr Ser Ser He Pro Ser Arg Arg Ala Cys Cys Ser Pro 

260 265 270 

Thr Ser He Ser Gly Arg Ser Gly Trp Gin Arg Thr Pro Pro He Arg 

275 280 285 

Pro Gly Thr He Arg Gly Arg Met Pro Val Cys Ser Thr Glu Asn Pro 

290 295 300 

Gin Ala Arg Arg Ala Leu Cys Ala Ala Ala Arg Pro Ser Arg Gly Arg 
305 310 315 320 

Gly Arg Cys Ser Gly Asp Leu 
325 

<210> 373 
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<211> 1056 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 373 

atgacgaatt ttctggacgt 
cgggaggcat cgacggagaa 
acgctcgcag cgttcggcga 
gcgcacaacc ggagcctgtt 
attccgagtc ccacaacgga 
gtcattggcg tcgttgaact 
ttcatcggtc gcgacgggac 
gagcggacga tctggggcga 
gggcggctga gcggcctgaa 
gccgcacagg gcacgcagtt 
ccgccgaacg aagcggctta 
gccggcgcgt acgtgatcag 
tatcgcgagt tagggcggtc 
ggcgaggtca ttgccgggcc 
gaagctctcc tcgcggccaa 
atctttcagc tgcgtgtcaa 
ttcacggcgg cgatcggaca 
cccttcgacc tggcggaatc 

<210> 374 
<211> 351 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 374 



Met 


Thr 


Asn 


Phe 


Leu 


Asp 


Val 


Thr 


Val 


Ala 


Ala Val Gin Ala Ala 


Pro 


1 








5 










10 


15 




Val 


Tyr 


Phe 


Asp 


Arg 


Glu 


Ala 


Ser 


Thr 


Glu 


Lys Ala Cys Arg Leu 


He 








20 










25 




30 




His 


Glu 


Ala 


Ala 


Gly 


Leu 


Gly 


Ala 


Thr 


Leu 


Ala Ala Phe Gly Glu 


Thr 






35 










40 






45 




Trp 


Leu 


Pro 


Gly 


Tyr 


Pro 


Phe 


Phe 


Val 


Trp 


Gly Phe Ala His Asn Arg 




50 










55 








60 




Ser 


Leu 


Phe 


Trp 


Gin 


Ala 


Ala 


Ala 


Glu 


Tyr 


He Ala Asn Ala Val 


Glu 


65 










70 










75 


80 


He 


Pro 


Ser 


Pro 


Thr 


Thr 


Asp 


Arg 


Leu 


Cys 


Ala Ala Ala Lys Ala Ala 










85 










90 


95 




Gly 


Val 


Asp 


Val 


Val 


He 


Gly 


Val 


Val 


Glu 


Leu Asp Glu Arg Thr Arg 








100 










105 




110 




Ala 


Ser 


Val 


Tyr 


Ser 


Thr 


Leu 


Leu 


Phe 


He 


Gly Arg Asp Gly Thr 


He 






115 










120 






125 




Leu 


Gly 


Arg 


His 


Arg 


Lys 


Leu 


Lys 


Pro 


Thr 


His Met Glu Arg Thr 


He 




130 










135 








140 




Trp 


Gly 


Glu 


Gly 


Asp 


Ala 


Tyr 


Gly 


Leu 


Arg 


Val Tyr Glu Arg Ser 


Tyr 


145 










150 










155 


160 


Gly 


Arg 


Leu 


Ser 


Gly 


Leu 


Asn 


Cys 


Trp 


Glu 


His Asn Met Met Leu 


Pro 










165 










170 


175 




Gly 


Tyr 


Val 


Leu 


Ala 


Ala 


Gin 


Gly 


Thr 


Gin 


Phe His Val Ala Ala 


Trp 








180 










185 




190 




Pro 


Gly 


Lys 


Glu 


Arg 


Leu 


Thr 


Val 


Pro 


Pro 


Asn Glu Ala Ala Tyr 


Thr 



gacggtggca gcggttcagg cggctcccgt ctatttcgat 60 

agcgtgccgg ttgatccacg aagcggcagg gctcggcgcg 120 

aacctggttg ccagggtatc cgttcttcgt ctgggggttc 180 

ctggcaggcc gccgccgagt acatcgccaa tgcggtggag 240 

ccgtctctgt gcggcggcga aggctgccgg ggtcgacgtc 300 

ggatgaacga acacgagctt cggtttacag tacgctgctt 360 

gatcctgggc cgccaccgca agctgaagcc aacacacatg 420 

aggggacgca tatggactcc gcgtctacga acgttcgtac 480 

ttgctgggaa cacaatatga tgctgcccgg ctacgtgctt 540 

tcacgtcgcc gcatggcccg gaaaggagag gctcaccgtc 600 

tacgcgccag cttctcctct ctcgcgcgta tgcatcccag 660 

cgtcgccggg ctgctcgcac cagactccat gcccgagcgt 720 

atatgagttg accggcgaca gcgtcatcgt cgacccgcgc 780 

tgcaaaaggc gagaccatcc tgctcgcgca gtgcagtcag 840 

gtccgccatc gacctcggcg gccattactc acgcccggat 900 

cgatcaactg cagcatcggg tccggagagt tgagccacac 960 

tatcggagcc gagcgccgat cccaggagga tggtactggt 1020 

tctcacgaac tcctag 1056 
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195 


200 


Arg 


Gin 


Leu 


Leu Leu Ser Arg Ala 




210 




215 


Val 


He 


Ser 


Val Ala Gly Leu Leu 


225 






230 


Tyr 


Arg 


Glu 


Leu Gly Arg Ser Tyr 








245 


Val 


Asp 


Pro 


Arg Gly Glu Val He 








260 


He 


Leu 


Leu 


Ala Gin Cys Ser Gin 






275 


280 


Ala 


He 


Asp 


Leu Gly Gly His Tyr 




290 




295 


Arg 


Val 


Asn 


Asp Gin Leu Gin His 


305 






310 


Phe 


Thr 


Ala 


Ala He Gly His He 








325 


Asp 


Gly 


Thr 


Gly Pro Phe Asp Leu 



340 







205 








Tyr 


Ala 


Ser Gin Ala 


Gly Ala 


Tyr 






220 








Ala 


Pro 


Asp Ser Met 


Pro 


Glu 


Arg 






235 






240 


Glu 


Leu 


Thr Gly Asp 


Ser 


Val 


He 




250 






255 




Ala 


Gly 


Pro Ala Lys 


Gly Glu 


Thr 


265 






270 






Glu 


Ala 


Leu Leu Ala 


Ala 


Lys 


Ser 






285 








Ser 


Arg 


Pro Asp He 


Phe 


Gin 


Leu 






300 








Arg 


Val 


Arg Arg Val 


Glu 


Pro 


His 






315 






320 


Gly 


Ala 


Glu Arg Arg 


Ser 


Gin 


Glu 




330 






335 




Ala 


Glu 


Ser Leu Thr 


Asn 


Ser 




345 






350 







<210> 375 
<211> 939 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 375 

atgaccaaaa tcgctgtcat tcaagaaccc ccggtctatc tgaatctgag caaatcgatg 60 

gaccgagcgg tcgacttgat tgccgatgcc gcgagccagg ggtgtcagtt gattgtgttt 120 

cccgaagcct ggcttgcagg ttaccccacc ttcgtctggc gtcttgcgcc gggcagcgga 180 

atgggaaaaa ccgatgagct ttacgcgcgt ttgctcgcca actcggtcga ccgcagcaaa 240 

gaggggcttc ggcctttgca ggaggctgca aaagagcatg gcgttgtcat tgttctgggt 300 

tatcaagagg tggacggctc gggcagcagc agcacaatct tcaacagctg cgcgattatt 360 

gatgccgacg ggcgactggc caacaaccat cgcaagttga tgcccaccaa tgcggagcgg 420 

atggtgtggg ggtttggcga cggttcgggc ctgaacgttg ttgacaccgc ggtgggcagg 480 

atcggcacgc tgatttgctg ggaaaactac atgcccttgg cgcgctacgc gctgtatgcc 540 

caaaacatcg aaatctatgt ggcgccgacc tgggacagcg gtgccatgtg gcaagccacc 600 

ctgcaacata tcgcacgtga aggtggctgc tgggtcatcg gatgtgcaac ctcgctgcaa 660 

gcctctgaca tcccggacga ccttccccat cgggatgagt tattcccgaa caaagacgaa 720 

tgggtgaacc ctggcgatgc ggtggtttac aaaccttttg gcggccttgt ggccggcccc 780 

atgcatcagg aaaaggggct tctcatcgca gagttggacg tcgccgctgt tcaggtctca 840 

cgtcggaagt tcgatgcgac cgggcattac gctcgccccg atgtcttcca actgcacgtg 900 

aatcgcagcg cgatgcggcc ggttgagttc acgaattag 939 

<210> 376 
<211> 312 
<212> PRT 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 376 

Met Thr Lys He Ala Val He Gin 

1 5 
Ser Lys Ser Met Asp Arg Ala Val 
20 

Gin Gly Cys Gin Leu He Val Phe 
35 40 



Glu Pro Pro Val Tyr Leu Asn Leu 

10 15 
Asp Leu He Ala Asp Ala Ala Ser 
25 30 
Pro Glu Ala Trp Leu Ala Gly Tyr 
45 
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Pro 


Thr 


Phe 


Val 


Trp 


Arg 


Leu 


Ala 




50 










55 




Asp 


Glu 


Leu 


Tyr Ala Arg 


Leu 


Leu 


65 










70 






Glu 


Gly 


Leu 


Arg 


Pro 


Leu 


Gin 


Glu 










85 








lie 


Val 


Leu 


Gly Tyr 


Gin 


Glu 


Val 








100 










lie 


Phe 


Asn 


Ser 


Cys 


Ala 


He 


He 






115 










120 


Asn 


His 


Arg 


Lys 


Leu 


Met 


Pro 


Thr 




130 










135 




Phe 


Gly 


Asp 


Gly Ser Gly 


Leu 


Asn 


145 










150 






lie 


Gly 


Thr 


Leu 


He 


Cys 


Trp Glu 










165 








Ala 


Leu 


Tyr 


Ala 


Gin 


Asn 


He 


Glu 








180 










Ser 


Gly 


Ala 


Met 


Trp 


Gin 


Ala 


Thr 






195 










200 


Gly 


Cys 


Trp 


Val 


He 


Gly 


Cys Ala 




210 










215 




Pro 


Asp 


Asp 


Leu 


Pro 


His 


Arg Asp 


225 










230 






Trp 


Val 


Asn 


Pro Gly Asp 


Ala 


Val 










245 








Val 


Ala 


Gly 


Pro 


Met 


His 


Gin 


Glu 








260 










Asp 


Val 


Ala 


Ala 


Val 


Gin 


Val 


Ser 






275 










280 


His 


Tyr 


Ala 


Arg 


Pro Asp 


Val 


Phe 




290 










295 




Met 


Arg 


Pro 


Val 


Glu 


Phe 


Thr 


Asn 


305 










310 







<210> 377 
<211> 1050 
<212> DNA 
<213> Unknown 



Pro Gly Ser Gly 


Met 


Gly 


Lys 


Thr 


60 










Ala Asn Ser Val 


Asp 


Arg 


Ser 


Lys 


75 








80 


Ala Ala Lys Glu 


His 


Gly 


Val 


Val 


90 






95 




Asp Gly Ser Gly 


Ser 


Ser 


Ser 


Thr 


105 




110 






Asp Ala Asp Gly 


Arg 


Leu 


Ala 


Asn 




125 








Asn Ala Glu Arg 


Met 


Val 


Trp 


Gly 


140 










Val Val Asp Thr 


Ala 


Val 


Gly 


Arg 


155 








160 


Asn Tyr Met Pro 


Leu 


Ala 


Arg 


Tyr 


170 






175 




lie Tyr Val Ala 


Pro 


Thr 


Trp 


Asp 


185 




190 






Leu Gin His He 


Ala 


Arg 


Glu 


Gly 




205 








Thr Ser Leu Gin 


Ala 


Ser 


Asp 


He 


220 










Glu Leu Phe Pro 


Asn 


Lys 


Asp 


Glu 


235 








240 


Val Tyr Lys Pro 


Phe 


Gly 


Gly 


Leu 


250 






255 




Lys Gly Leu Leu 


He 


Ala 


Glu 


Leu 


265 




270 






Arg Arg Lys Phe 


Asp 


Ala 


Thr 


Gly 




285 








Gin Leu His Val 


Asn 


Arg 


Ser 


Ala 



300 



<220> 

<223> Obtained from an environmental sample 



<400> 377 

atgtccagca ccacccatcc ccgcctgcgc gtcgccgcgg tgcaagccgc ccccgtcttc 60 

ctcgacctcg acggcaccat cgacaagacc atcgacctga tggcccgggc ggccggccag 120 

ggcgtgcagc tgatcgcctt tcccgagacc tgggtgcccg gctatccgtg gtggatctgg 180 

ctcgattcgc cggcctgggg catgcagttc gtgcagcgct accacgacaa cgccctggtc 240 

gtcggctcgc ccgagttcga ccgcattcgc gaggccgcgc gcaagcaccg catctgggtc 300 

tcgctcggct acagcgagaa ggccgccggc agcctctaca tcgcccaggc gctgatcgac 360 

gaccagggca acacgctgca gactcggcgc aagctcaagc cgacgcacgt ggagcgcacc 420 

gtgttcggcg agggcgacgg atcggacctg agcgtggtcg agacggctat cggcaacatc 480 

ggctcgctgt cgtgctggga gcacctgcag ccgctcagca agtacgcgat gtactcgcag 540 

aacgagcaga tccattgcgg cgcctggccc agcttctcgc tctaccgcgg cggcgcctac 600 

gcgctcggcg ccgaagtgaa caacgccgcc agccaggtgt acgcggccga gggccagtgc 660 

ttcgtgatcg cgccctgcgc cacggtctcg aaggcgatgc acgaactgct gtgcaccgac 720 

cctggcaagc agcagatgct gctggtcggc ggcggcttcg cgcgcatcta cggacccgac 780 

ggatcgccgc tcggcaagaa cctggcagag gacgaggaag ggctggtggt ggccgacatc 84 0 

gacctcggca tgatctccct ggccaaggcg gccggcgacc cggccggcca ctattcgcgg 900 

cccgacgtga cgcagttgct gttcaacagg aagcggcgcg agccggtggt gctgcaaggc 960 

ccggccgagc ccgagaaggc ggtcgccgag ccggtgtcca cgccgagcga agcggcggcg 1020 
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gccgccgccc gccagccggt cgtggcctga 1050 

<210> 378 
<211> 349 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 378 



Met 


Ser 


Ser 


Thr Thr His 


Pro 


Arg Leu 


Arg 


Val Ala 


Ala Val Gin 


Ala 


1 






5 






10 




15 




Ala 


Pro 


Val 


Phe Leu Asp 


Leu 


Asp Gly 


Thr 


He Asp 


Lys Thr He 


Asp 








20 




25 






30 


Leu 


Met 


Ala 


Arg Ala Ala Gly 


Gin Gly 


Val 


Gin Leu 


He Ala Phe 


Pro 






35 






40 






45 




Glu 


Thr 


Trp 


Val Pro Gly Tyr 


Pro Trp 


Trp 


He Trp 


Leu Asp Ser 


Pro 




50 






55 






60 




Ala 


Trp 


Gly 


Met Gin Phe 


Val 


Gin Arg 


Tyr 


His Asp 


Asn Ala Leu 


Val 


65 






70 








75 




80 


Val 


Gly 


Ser 


Pro Glu Phe Asp 


Arg He 


Arg 


Glu Ala 


Ala Arg Lys His 








85 






90 




95 




Arg 


lie 


Trp 


Val Ser Leu Gly 


Tyr Ser 


Glu 


Lys Ala 


Ala Gly Ser Leu 








100 




105 






110 




Tyr 


lie 


Ala 


Gin Ala Leu 


He 


Asp Asp 


Gin 


Gly Asn 


Thr Leu Gin 


Thr 






115 






120 






125 




Arg 


Arg 


Lys 


Leu Lys Pro 


Thr 


His Val 


Glu 


Arg Thr 


Val Phe Gly Glu 




130 






135 






140 






Gly 


Asp 


Gly 


Ser Asp Leu 


Ser 


Val Val 


Glu 


Thr Ala 


He Gly Asn 


He 


145 






150 








155 


160 


Gly 


Ser 


Leu 


Ser Cys Trp 


Glu 


His Leu 


Gin 


Pro Leu 


Ser Lys Tyr 


Ala 








165 






170 




175 




Met 


Tyr 


Ser 


Gin Asn Glu 


Gin 


He His 


Cys 


Gly Ala 


Trp Pro Ser 


Phe 








180 




185 






190 




Ser 


Leu 


Tyr 


Arg Gly Gly Ala 


Tyr Ala 


Leu 


Gly Ala 


Glu Val Asn 


Asn 






195 






200 






205 




Ala 


Ala 


Ser 


Gin Val Tyr Ala 


Ala Glu 


Gly 


Gin Cys 


Phe Val He 


Ala 




210 






215 






220 






Pro 


Cys 


Ala 


Thr Val Ser Lys 


Ala Met 


His 


Glu Leu 


Leu Cys Thr Asp 


225 






230 








235 




240 


Pro 


Gly 


Lys 


Gin Gin Met 


Leu 


Leu Val 


Gly 


Gly Gly 


Phe Ala Arg 


He 








245 






250 




255 




Tyr 


Gly 


Pro 


Asp Gly Ser 


Pro 


Leu Gly 


Lys 


Asn Leu 


Ala Glu Asp Glu 








260 




265 






270 




Glu 


Gly 


Leu 


Val Val Ala 


Asp 


He Asp 


Leu 


Gly Met 


He Ser Leu 


Ala 






275 






280 






285 




Lys 


Ala 


Ala 


Gly Asp Pro 


Ala 


Gly His 


Tyr 


Ser Arg 


Pro Asp Val 


Thr 




290 






295 






300 






Gin 


Leu 


Leu 


Phe Asn Arg 


Lys 


Arg Arg 


Glu 


Pro Val 


Val Leu Gin 


Gly 


305 






310 








315 




320 


Pro 


Ala 


Glu 


Pro Glu Lys 


Ala 


Val Ala 


Glu 


Pro Val 


Ser Thr Pro 


Ser 








325 






330 




335 




Glu 


Ala 


Ala 


Ala Ala Ala 


Ala 


Arg Gin 


Pro 


Val Val 


Ala 










340 




345 











<210> 379 
<211> 936 
<212> DNA 
<213> Unknown 
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<220> 

<223> Obtained from an environmental sample 
<400> 379 

atggctaaaa tagcgattgt acaaaaggcg tctgttacct tgaataaaca agaaactgtc 60 

gccagtgctg taagagaggt agagctggca gcggcggaag gtgccgagct ggtagtgttt 120 

actgaggcat ttattgctgg gtatccggcc tggatctggc gtttgcggcc aggtggtgac 180 

tgggggctat ctgaagatct tcattcccgt ttgttgacaa gcgccgtaga cctgggtggt 240 

gatgacctgg atccacttta tgccgcagct aaagaaaata acgtgacgat agtgtgcggt 300 

attaatgaac gtgataaccg gctcagtaag gcaacgctat ataattctat cgttattatt 360 

ggttccgatg gttcattgtt aaatcgacat cgtaagttga tgccgacgaa tccggagaga 420 

atggtatggg gctttggtga tgcctctggt ctgaaggtcg ttgatacccc cgttggtcgt 480 

gttggtacgc ttgtctgttg ggaaaactat atgcccttgg ccagatatgc gttgtattcg 540 

cagggggtag aggtttatat tgcgccgacc tacgatagcg gtgatgactg gatttctaca 600 

ttacagcata ttgccaggga gggtcgttgt tgggttgttg gctgtggcaa tctattgcgt 660 

ggcagcgata taccggatga cttccctgag aagttggcgt tatacccaga tgaggatgag 720 

tggataaatc ctggggattc cgttgtgatt gcacctgggg gtaaaatcat ggccgggcca 780 

ttgcgccagg aggcggggat tgtctattgt gatattgcgt ctgaaagtgc cagtcaggca 840 

aaacgtgcgc tggatgtggc tggacattat tcccggcctg atatctttga gttgcatgtg 900 

aatacgaagg tgcagacccc ggttgtatat gattag 936 

<210> 380 

<211> 311 

<212> PRT 

<213> Unknown 

<220> 

<223> Obtained from an environmental sample 



<400> 380 



Met Ala 


Lys 


lie 


Ala He 


Val 


Gin 


Lys 


Ala 


Ser Val Thr Leu Asn 


Lys 


1 




5 








10 


15 




Gin Glu 


Thr 


Val 


Ala Ser 


Ala 


Val 


Arg 


Glu 


Val Glu Leu Ala Ala 


Ala 






20 








25 




30 




Glu Gly 


Ala 


Glu 


Leu Val 


Val 


Phe 


Thr 


Glu 


Ala Phe He Ala Gly Tyr 


35 








40 






45 




Pro Ala 


Trp 


lie 


Trp Arg 


Leu 


Arg 


Pro 


Gly 


Gly Asp Trp Gly Leu 


Ser 


50 








55 








60 




Glu Asp 


Leu 


His 


Ser Arg 


Leu 


Leu 


Thr 


Ser 


Ala Val Asp Leu Gly Gly 


65 






70 










75 


80 


Asp Asp 


Leu 


Asp 


Pro Leu 


Tyr 


Ala 


Ala 


Ala 


Lys Glu Asn Asn Val 


Thr 








85 








90 


95 




lie Val 


Cys 


Gly 


He Asn 


Glu 


Arg 


Asp 


Asn 


Arg Leu Ser Lys Ala 


Thr 






100 








105 




110 




Leu Tyr 


Asn 


Ser 


He Val 


He 


He 


Gly 


Ser 


Asp Gly Ser Leu Leu 


Asn 


115 








120 






125 




Arg His 


Arg 


Lys 


Leu Met 


Pro 


Thr 


Asn 


Pro 


Glu Arg Met Val Trp 


Gly 


130 








135 








140 




Phe Gly 


Asp 


Ala 


Ser Gly 


Leu 


Lys 


Val 


Val 


Asp Thr Pro Val Gly Arg 


145 






150 










155 


160 


Val Gly 


Thr 


Leu 


Val Cys 


Trp 


Glu 


Asn 


Tyr 


Met Pro Leu Ala Arg 


Tyr 






165 








170 


175 




Ala Leu 


Tyr 


Ser 


Gin Gly 


Val 


Glu 


Val 


Tyr 


He Ala Pro Thr Tyr 


Asp 






180 








185 




190 




Ser Gly 


Asp 


Asp 


Trp He 


Ser 


Thr 


Leu 


Gin 


His He Ala Arg Glu Gly 


195 








200 






205 




Arg Cys 


Trp 


Val 


Val Gly 


Cys 


Gly 


Asn 


Leu 


Leu Arg Gly Ser Asp 


He 


210 








215 








220 




Pro Asp 


Asp 


Phe 


Pro Glu 


Lys 


Leu 


Ala 


Leu 


Tyr Pro Asp Glu Asp 


Glu 


225 






230 










235 


240 


Trp lie 


Asn 


Pro 


Gly Asp 


Ser 


Val 


Val 


He 


Ala Pro Gly Gly Lys 


He 
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245 250 255 

Met Ala Gly Pro Leu Arg Gin Glu Ala Gly He Val Tyr Cys Asp He 

260 265 270 

Ala Ser Glu Ser Ala Ser Gin Ala Lys Arg Ala Leu Asp Val Ala Gly 

275 280 285 

His Tyr Ser Arg Pro Asp He Phe Glu Leu His Val Asn Thr Lys Val 

290 295 300 

Gin Thr Pro Val Val Tyr Asp 
305 310 

<210> 381 

<211> 945 

<212> DNA 

<213> Clostridium acetobutylicum ATCC 3625 



<400> 381 

atgggcgaat 

cgcgaggcct 

gatcttctgg 

tggtctcgcg 

ggtccggaaa 

ggagtagtag 

agccgcgagg 

cgttactggt 

cttagcggcc 

cagggcaccc 

ctgtctcgtg 

ccggccccag 

gagtcatgta 

acccttatca 

gtgggcggtc 

ccccgtgtcg 



tcggtgaagt 
cgaccgagaa 
cgttcggtga 
aatacaacga 
cggacgctct 
aactggagcc 
gcgagatcct 
ctgaaggcga 
tgaattgctg 
agttccatgt 
cctacgctat 
gtgaactgcc 
tcatcgaccc 
cggctcgcgt 
attatagccg 
tgtttcgcga 



taccctgggc 
agcctgtggc 
aacgtggtta 
cttgcgtgca 
gtgccaagca 
gggctctctt 
gggtcgtcac 
cgcgactggt 
ggagcacact 
ggccgcttgg 
gcaggcgggc 
ggatggtatc 
gtgggggaaa 
tagcaccgca 
cccggatgtt 
tggtgacgtg 



gtggctcagg 
ctgatccgcg 
accggctatc 
cgttatgttg 
gccgcggaag 
tcctcggttt 
cgcaaactga 
ctgcgcgttt 
atgatgctgc 
ccaaacatgg 
tgctacgttt 
gcggcggaaa 
gtaattgctg 
tctatttatc 
ttccgttttg 
gatgaccgag 



ctgctcccgt 
aggcgggcga 
catactggaa 
cgaacggtgt 
caggtgtgga 
attgcactct 
aaccgaccga 
atgagcgccc 
cggggtacgc 
catcctcgaa 
tatgcgcggg 
gtttagatca 
gtccggtgtc 
gccgcaaaag 
aagtcgatcg 
gttaa 



gtactttgac 
aaagggtgta 
agatgcgcct 
gatgattcct 
tgtggcgatc 
gttatttata 
tagcgaacgt 
atatggtcgg 
cctggcggcg 
ttctgaactt 
tggcctgggc 
cttgactgga 
ttgcgaggaa 
tttgacggac 
ctctgagcgt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
945 



<210> 382 
<211> 314 
<212> PRT 

<213> .Clostridium acetobutylicum ATCC 3625 



<400> 382 

Met Gly Glu Phe Gly Glu Val Thr Leu Gly Val Ala 

15 10 
Val Tyr Phe Asp Arg Glu Ala Ser Thr Glu Lys Ala 

20 25 
Arg Glu Ala Gly Glu Lys Gly Val Asp Leu Leu Ala 

35 40 
Trp Leu Thr Gly Tyr Pro Tyr Trp Lys Asp Ala Pro 

50 55 60 

Tyr Asn Asp Leu Arg Ala Arg Tyr Val Ala Asn Gly 
65 70 75 

Gly Pro Glu Thr Asp Ala Leu Cys Gin Ala Ala Ala 

85 90 
Asp Val Ala He Gly Val Val Glu Leu Glu Pro Gly 

100 105 
Val Tyr Cys Thr Leu Leu Phe He Ser Arg Glu Gly 

115 120 
Arg His Arg Lys Leu Lys Pro Thr Asp Ser Glu Arg 
130 135 140 

Glu Gly Asp Ala Thr Gly Leu Arg Val Tyr Glu Arg 
145 150 155 

Leu Ser Gly Leu Asn Cys Trp Glu His Thr Met Met 
165 170 



Gin Ala Ala Pro 
15 

Cys Gly Leu He 
30 

Phe Gly Glu Thr 
45 

Trp Ser Arg Glu 

Val Met He Pro 
80 

Glu Ala Gly Val 
95 

Ser Leu Ser Ser 
110 

Glu He Leu Gly 
125 

Arg Tyr Trp Ser 

Pro Tyr Gly Arg 
160 

Leu Pro Gly Tyr 
175 
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Ala 


Leu 


Ala 


Ala Gin Gly 
180 


Thr 


Gin 


Phe 
185 


His 


Val 


Ala 


Ala 


Trp 
190 


Pro 


Asn 


Met 


Ala 


Ser 
195 


Ser Asn Ser 


Glu 


Leu 
200 


Leu 


Ser 


Arg 


Ala 


Tyr 
205 


Ala 


Met 


Gin 


Ala 


Gly 
210 


Cys 


Tyr Val Leu 


Cys 
215 


Ala 


Gly 


Gly 


Leu 


Gly 
220 


Pro 


Ala 


Pro 


Gly 


Glu 


Leu 


Pro 


Asp Gly He 


Ala 


Ala 


Glu 


Ser 


Leu 


Asp 


His 


Leu 


Thr 


Gly 


225 






230 










235 










240 


Glu 


Ser 


Cys 


He He Asp 
245 


Pro 


Trp 


Gly 


Lys 
250 


Val 


He 


Ala 


Gly 


Pro 
255 


Val 


Ser 


Cys 


Glu 


Glu Thr Leu 
260 


He 


Thr 


Ala 
265 


Arg 


Val 


Ser 


Thr 


Ala 
270 


Ser 


He 


Tyr 


Arg 


Arg 
275 


Lys Ser Leu 


Thr 


Asp 
280 


Val 


Gly 


Gly 


His 


Tyr 
285 


Ser 


Arg 


Pro 


Asp 


Val 


Phe 


Arg Phe Glu 


Val 


Asp 


Arg 


Ser 


Glu 


Arg 


Pro 


Arg 


Val 


Val 




290 






295 










300 








Phe 


Arg 


Asp 


Gly Asp Val 


Asp 


Asp 


Arg 


Gly 














305 






310 























<210> 383 
<211> 1041 
<212> DNA 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 



<400> 383 

atgtcggagc ccatgacgaa gtatcgcggc gcggcggtgc aggccgcgcc ggtgttcctc 60 

gatctcgacc gcacagtcga gaaagcgatc ggcctgatcg agcaggcggc caagcaggac 120 

gtgcgcctga tcgcattccc agagacttgg attcccggct atcccttttg gatatggctg 180 

ggcgcgccgg cttggggcat gcgcttcgtc cagcgctatt tcgagaattc gctcgtgcgc 240 

ggcagcaagc agtggcaggc cctggcggat gcggcccgcc gccacggcat gcatgtcgtg 300 

gccggctata gcgagcgcgc gggcggcagc ctctatatgg gccaggcgat cttcggcccc 360 

gatggcgatc tgatcgccgc gcgccgcaag ctcaagccta cccatgcgga gcgcaccgtg 420 

ttcggcgagg gagacggcag ccatctcgcg gtgcacgata ccgccatcgg gcgcctcggc 480 

gcgctctgtt gctgggagca catccagcca ttgtqgaaat acgccatgta cgccgccgac 540 

gaacaggtcc acgtcgcgtc gtggccgagc ttcagcctct atcgcggcat ggcctatgcg 600 

ctcggaccgg aggtcaatac cgccgcaagc cagatctacg cggtcgaggg cggctgctac 660 

gtgctggcgt cgtgcgcgac cgtttcgccg gagatgatca aggtattggt ggatacgccc 720 

gacaaggaga tgttcctcaa ggccggcggc ggttttgcca tgattttcgg gcccgacggc 780 

cgcgccctgg ccgagccgct cccggagacc gaagagggac tgctggtcgc cgatatcgac 840 

ctcggcatga tcgcgttggc caaggcggcg gccgatccgg cgggccacta ttcacggccc 900 

gacgtaacgc ggctgctgct ggatcgacgt ccggcccaac gcgtcgtcac gcttgatgcc 960 

gcattcgaac cgcaaaacga ggacaagggc gacgcgcccg cgctgcgcgt ggtggcggaa 1020 

agcgccgccg ccgcgcagta g 1041 

<210> 384 
<211> 346 
<212> PRT 
•<213> Unknown 



<220> 

<223> Obtained from an environmental sample 
<400> 384 

Met Ser Glu Pro Met Thr Lys Tyr Arg Gly Ala Ala Val Gin Ala Ala 

15 10 15 

Pro Val Phe Leu Asp Leu Asp Arg Thr Val Glu Lys Ala He Gly Leu 

20 25 30 

He Glu Gin- Ala Ala Lys Gin Asp Val Arg Leu lie Ala Phe Pro Glu 
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35 40 45 

Thr Trp lie Pro Gly Tyr Pro Phe Trp lie Trp Leu Gly Ala Pro Ala 

50 55 60 

Trp Gly Met Arg Phe Val Gin Arg Tyr Phe Glu Asn Ser Leu Val Arg 
65 70 75 80 

Gly Ser Lys Gin Trp Gin Ala Leu Ala Asp Ala Ala Arg Arg His Gly 

85 90 95 

Met His Val Val Ala Gly Tyr Ser Glu Arg Ala Gly Gly Ser Leu Tyr 

100 105 110 

Met Gly Gin Ala lie Phe Gly Pro Asp Gly Asp Leu lie Ala Ala Arg 

115 120 125 

Arg Lys Leu Lys Pro Thr His Ala Glu Arg Thr Val Phe Gly Glu Gly 

130 135 140 

Asp Gly Ser His Leu Ala Val His Asp Thr Ala He Gly Arg Leu Gly 
145 150 155 160 

Ala Leu Cys Cys Trp Glu His He Gin Pro Leu Ser Lys Tyr Ala Met 

165 170 175 

Tyr Ala Ala Asp Glu Gin Val His Val Ala Ser Trp Pro Ser Phe Ser 

180 185 190 

Leu Tyr Arg Gly Met Ala Tyr Ala Leu Gly Pro Glu Val Asn Thr Ala 

195 200 205 

Ala Ser Gin He Tyr Ala Val Glu Gly Gly Cys Tyr Val Leu Ala Ser 

210 215 220 

Cys Ala Thr Val Ser Pro Glu Met He Lys Val Leu Val Asp Thr Pro 
225 230 235 240 

Asp Lys Glu Met Phe Leu Lys Ala Gly Gly Gly Phe Ala Met He Phe 

245 250 255 

Gly Pro Asp Gly Arg Ala Leu Ala Glu Pro Leu Pro Glu Thr Glu Glu 

260 265 270 

Gly Leu Leu Val Ala Asp He Asp Leu Gly Met He Ala Leu Ala Lys 

275 280 285 

Ala Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp Val Thr Arg 

290 295 300 

Leu Leu Leu Asp Arg Arg Pro Ala Gin Arg Val Val Thr Leu Asp Ala 
305 310 315 320 

Ala Phe Glu Pro Gin Asn Glu Asp Lys Gly Asp Ala Pro Ala Leu Arg 

325 330 335 

Val Val Ala Glu Ser Ala Ala Ala Ala Gin 
340 345 

<210> 385 
<211> 1014 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 385 

atgaaagaag ctatcaaggt cgcctgcgtg caagccgccc cgatctacat ggatttgaag 60 

gcgacggtgg acaaaaccat tgagttgatg gaagaagcag cacgtaataa tgctcgtctg 120 

atcgcctttc cggaaacttg gattccaggc tacccatggt ttctttggct tgactcacca 180 

gcatgggcaa tgcaatttgt acgccaatac catgagaact cattggagtt ggatggccct 240 

caagctaagc gcatttcaga tgcagccaag cggttgggaa tcatggtcac cctggggatg 300 

agtgaacggg tcggtggcac cctttacatc agtcagtggt tcataggcga taatggtgac 360 

accattgggg cccggcgaaa gttgaaacct acttttgttg aacgtacttt gttcggcgaa 420 

ggggatggtt catcgctagc ggttttcgag acgtctgttg gaaggctggg tggcttatgc 480 

tgttgggagc accttcaacc gctaacaaaa tacgctttgt atgcacaaaa tgaagagatt 540 

cattgtgcgg cttggccgag ctttagcctt tatcctaatg cggcgaaagc cctggggcct 600 

gatgtcaatg tagcggcctc tcgaatctat gccgttgaag ggcaatgctt cgtactagcg 660 

tcgtgtgcgc tcgtttcaca atccatgatc gatatgcttt gtacagatga cgaaaagcat 720 
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gcgttgcttc tggctggtgg tggacactca cgtatcatag ggcctgatgg tggtgacttg 780 

gtcgcgcctc ttgccgaaaa tgaagagggt attctctacg caaaccttga tcctggagta 840 

cgcatccttg ctaaaatggc ggcagaccct gctggtcatt attcccgtcc cgacattact 900 

cgcttgctaa tagatcgcag ccctaaatta ccggtagttg aaattgaagg tgatcttcgt 960 

ccttacgctt tgggtaaagc gtctgagacg ggtgcgcaac tcgaagaaat ttga 1014 

<210> 386 
<211> 337 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 386 

Met Lys Glu Ala He Lys Val Ala Cys Val Gin Ala Ala Pro He Tyr 

15 10 15 

Met Asp Leu Lys Ala Thr Val Asp Lys Thr He Glu Leu Met Glu Glu 

20 25 30 

Ala Ala Arg Asn Asn Ala Arg Leu He Ala Phe Pro Glu Thr Trp He 

35 40 45 

Pro Gly Tyr Pro Trp Phe Leu Trp Leu Asp Ser Pro Ala Trp Ala Met 

50 55 60 

Gin Phe Val Arg Gin Tyr His Glu Asn Ser Leu Glu Leu Asp Gly Pro 
65 70 75 80 

Gin Ala Lys Arg He Ser Asp Ala Ala Lys Arg Leu Gly He Met Val 

85 90 95 

Thr Leu Gly Met Ser Glu Arg Val Gly Gly Thr Leu Tyr He Ser Gin 

100 105 110 

Trp Phe He Gly Asp Asn Gly Asp Thr He Gly Ala Arg Arg Lys Leu 

115 120 125 

Lys Pro Thr Phe Val Glu Arg Thr Leu Phe Gly Glu Gly Asp Gly Ser 

130 135 140 

Ser Leu Ala Val Phe Glu Thr Ser Val Gly Arg Leu Gly Gly Leu Cys 
145 150 155 160 

Cys Trp Glu His Leu Gin Pro Leu Thr Lys Tyr Ala Leu Tyr Ala Gin 

165 170 175 

Asn Glu Glu He His Cys Ala Ala Trp Pro Ser Phe Ser Leu Tyr Pro 

180 185 190 

Asn Ala Ala Lys Ala Leu Gly Pro Asp Val Asn Val Ala Ala Ser Arg 

195 200 205 

He Tyr Ala Val Glu Gly Gin Cys Phe Val Leu Ala Ser Cys Ala Leu 

210 215 220 

Val Ser Gin Ser Met He Asp Met Leu Cys Thr Asp Asp Glu Lys His 
225 230 235 240 

Ala Leu Leu Leu Ala Gly Gly Gly His Ser Arg He He Gly Pro Asp 

245 250 255 

Gly Gly Asp Leu Val Ala Pro Leu Ala Glu Asn Glu Glu Gly He Leu 

260 265 270 

Tyr Ala Asn Leu Asp Pro Gly Val Arg He Leu Ala Lys Met Ala Ala 

275 280 285 

Asp Pro Ala Gly His Tyr Ser Arg Pro Asp He Thr Arg Leu Leu He 

290 295 300 

Asp Arg Ser Pro Lys Leu Pro Val Val Glu He Glu Gly Asp Leu Arg 
305 310 315 320 

Pro Tyr Ala Leu Gly Lys Ala Ser Glu Thr Gly Ala Gin Leu Glu Glu 
325 330 335 

He 
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